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PREFACE 


Learning occupies one of the central positions in psychology, and an under- 
standing of the psychology of learning should be useful to almost every 
serious student of psychology. The aim of the first edition of this book was 
to bring together for the student the divergent interests in the psychology 
of learning. This, likewise, is the aim of the new edition. In other respects, 
however, the two editions are very different. The first edition was a survey; 
this edition is less so. The present version treats fewer separate topics and 
treats these in greater detail. It is more theoretically directed than was the 
first edition and more selective. The more intensive coverage is achieved 
by eliminating topics that are of little or no fundamental importance and 
by eliminating peripheral topics which are generally more adequately 
covered in other courses. As before, however, the author has strenuously 
tried to resist the inclination to ride special hobbyhorses; the material is 
selected, but it is selected in such a way as to give the student as representa- 
tive a picture as possible of the contemporary concern with the psychology 
of learning. 


As in the first edition, the early chapters attempt to cover the basic prob- 
lems of conditioning and learning. The material in these chapters deals with 
the acquisition of individual responses, and there is litde attention given in 
these chapters to the inte raction bet wgfiiL j-esponses. The basic controlling 
variables — reinforcement, dme, freqi^ncy, etc. — are discussed in these 
chapters. The middle chapters provide a'bridge to problems in learning which 
center around interactions of learned responses. Chaining, or serial learn- 
ing, and transfer of training are examples of the topics covered. The later 


chapters deal with retention, problem solving, and the other processes under 
the control of implicit and mediating activity. . i 4^ 

The approach of the book isTargely empirical, but it has been 
(and undesirable) to avoid a theoretical bias, however slight. It is ti^ 
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detect, and instructors will not find it difficult to present somewhat different 
theoretical positions without interference from the book. To the informed 
reader the influence of B. F. Skinner, E. C. Tolman, and the late C. L. Hull 
is obvious in the early chapters. Perhaps less obvious is the fact that the 
author is considerably indebted for the point of view in the later chapters 
to G. A. Miller, C. N. Cofer, C. E. Osgood, and others who have recently 
worked so intensively on the role of coding and mediating activity in the 
human intellectual processes. 

Again, as was the case in the first edition, there has been some attempt 
to compromise between what experimental psychologists find fruitful (or 
easy) to investigate in the laboratory and what students see as more funda- 
mental problems. Although there has been very little effort to treat “ap- 
plied problems as such, the relation of basic concepts in the psychology 
of learning to things outside the rat laboratory and the memory-drum cubi- 
cle is stressed throughout the book. The author has the fond hope that one 
day the psychology of learning will return to its original position as one of 
the basic sciences underlying educational psychology and similar applied 

fields. There is no doubt that some sections of this book have been influ- 
enced by that hope. 

Although the book is principally addressed to the junior or senior stu- 
dent in psychology, graduate students in psychology and in educational 
psychology will find it useful. The book is relatively self-sufficient and can 
be studied with profit even by a student with only the slightest background 
m psychology. It is the author’s hope, at the same time, that it will provide 
a foundation for further study, particularly the study of theoretical issues 
which can only be understood with some background in the basic empirical 
problems in the psychology of learning. 

Numerous people have read all or portions of the manuscript. To these 
individuals the author gives his thanks. Many of the suggestions made by 
readers have been incorporated into the book; in other cases the author, 
perhaps rashly, has decided to let matters stand — partly because he wanted 
the book to reflect the contemporary scene in the psychology of learning as 
faithfully as possible. Special thanks are due to the author’s wife, Ellin 
Deese, who carefully read many versions of the manuscript and who typed 
the larger portion of the final draft. Also, a special word of thanks is due 

to Clifford T. Morgan, whose very considerable editorial talents include the 
ability to teach others some of the skills of writing. 

Permission has been granted by many publishers and individuals to re- 
produce illustrations or passages, and the author gratefully acknowledges 
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these permissions: American Journal of Psychology; the American Psy- 
chological Association for its several journals; Appleton-Century-Crofts, 
Inc.; Cambridge University Press; Ralph Gerbrands; Harvard University 
Press; Journal of General Psychology; J. B. Lippincott Company; Ronald 
Press Company; F. Loren Smith; Teachers College Contributions to Educa- 
tion; University of California Press; John Wiley & Sons, Inc. In each case 
the original source has been cited with the material reproduced. 


James Deese 
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CHAPTER 1 


INTRODUCTION 


'Most human behavior is learned. This fact in itself makes clear why the 
study of learning is one of the most important parts of psychology. It is im- 
possible to understand the behavior of human beings and that of most ani- 
mals without knowing something about the basic principles of learning. In 
order to have a thorough understanding of almost any problem in psychol- 
ogy, we should know what effect learning and different learning experiences 
have upon the particular problem we are studying. For example, if we were 
interested, as personality theorists are, in the effects of traumati c experi- 
gtice s during infancy or childhood upon adult personality, we should dis- 
cover that we could understand this problem only to the extent that we 
understood the basic principles of learning. If we knew as much about 
learning as we should like, we could answer questions about the extent to 
which traumatic experiences can affect adult life and the degree to which 
different segments of adult behavior may be affected. In short, it is not diffi- 
cult to prove the importance of learning in psychology. 

It is probably more difficult to convince the student that learning theory 
is of enough general interest and importance to warrant study. Theory, even 
to the fairly sophisticated student of psychology, often means fussy dis- 
putes about matters of no consequence. We must admit that there is a 
certain justice in this view. This should not blind us, however, to the impor- 
tance of theory, both in the understanding of learning itself and in its appli- 
cations to the study of personality and abnormal behavior. 

Theories serve several useful functions. They supply hypotheses about 
unsolved problems and serve to integrate and lend coherence to a scientific 
discipline. They help us to organize factual information and to understand 
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its various applications to other areas. It is usually more reasonable to 

formulate a hypothesis about personality development, for example, from 

a well-established theory of learning than from a simple factual rule. This is 

true because a theory frequenUy suggests why a particular application 

should be made, while the extension of a factual rule, more often than not, 
is simply a guess. 

There is no better way to appreciate the value of theory than to read a 
survey of the experimental literature about some aspect of learning. If we 
do this, we shall find hundreds of experiments which overlap, duplicate, 
and frequently contradict one another. With no theory to guide us, we 
should find it difficult to tell important from unimportant experiments. If 
we were to memorize the contents of such a survey, we should have at our 
command a mass of highly specific information limited to narrow examples 
of behavior. Most general principles are established through theory of some 
sort. Therefore a theory can be used to lift us beyond the confines of iso- 
lated experiments into an awareness of general principles about learning 
and behavior. 


Unfortunately there are very few theoretical principles in the psychology 
of learning that are both unequivocal and widely accepted, because very 
few existing theories are supported by a wide variety of evidence. We must 
therefore be cautious in the extent to which we apply theory. It is necessary 
to strike a balance between the simple factual information at our disposal 
and the general principles developed or invented to explain and extend it. 
Since complete devotion to a given theory is likely to be dangerous given 
our present psychological ignorance, we should be cautious in accepting 
a universal set of explanatory principles that can be applied to all problems 
in learning. Rather, we shall study some limited aspects of certain theories 
and their applications to specific problems. Some of the grand sweep of a 
really systematic theory is lost in this way, but, considering the limitations 
of our factual knowledge, this approach is more realistic. 


SOME METHODOLOGICAL BACKGROUND 

the^mvchJo evidence and theoretical principles of 

the psychology of learning, we need to establish some ground rules about 

finding evidence and discovering or inventing theoretical principles We 

meth^Zs tLv a?? T experiments and the logic of scientific 

as they apply to some of the problems we shall examine later. 
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Experiment and Observation 

A scientific observation occurs when one looks at an event and records it 
in such a way that any other competent observer, given the same conditions, 
would make the same report. An experiment, in the simplest sense, is an 
observation made under conditions that are, to some extent, of the ob- 
server's own devising. In an experiment the investigator can actually change 
and control the events that occur. Thus, watching squirrels retrieve buried 
nuts from the ground could be a scientific observation, but to turn this into 
an experiment, we should have to control some of the conditions of ob- 
servation (such as the place where the nuts are buried). 

Most of the factual information in the psychology of learning has been 
gathered in experiments, because, whenever possible, investigators prefer 
to experiment rather than simply to observe. This preference is well 
founded; it is always easier to infer causal relations when the events under 
study are controlled. The kind and extent of the control in the observation 
limit the adequacy and generality of the causal inferences the investigator 
may make. If only one condition is systematically varied, we can be a good 
deal more certain of the cause than if all conditions were allowed to vary 
haphazardly. Since we are usually interested in making causal inferences 
from our observations, it will be worthwhile to glance at some of the rules 
for conducting experiments. 

Independent and dependent variables. There is a particular way in 
which events are controlled in an experiment. In the simplest kind of ex- 
periment, all conditions are held constant but one, and this is allowed to 
vary in known ways. For example, we may want to study the effects of 
different kinds of rewards upon the speed with which monkeys can learn a 
new habit. To do this, we should systematically vary the nature of the 
reward given to different monkeys and try to keep constant in all monkeys 
the effects of motivation, previous history, testing conditions, etc. Since we 
had controlled all the variables but reward, we should be safe in making 
inferences from our results about the effects of quality of reward upon 
speed of learning. 

In this example, kind of reward is an independent variable because it is 
independently controlled and varied by the experimenter. The changes in 
behavior that result from changes in reward provide the dependent variable. 
Speed of learning, in this example, is the dependent variable because 
changes in it depend upon changes in the independent variable, reward. 
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Independent variables, then, are the conditions that are systematically 

varied by the experimenter; the dependent variables are the consequent 
changes in behavior. 


More than one independent or dependent variable. In the example just 
mentioned, all the variables but kind of reward were controlled, either by 
being held constant or by being allowed to vary randomly from animal to 
animal. In some experiments, however, we may use more than one inde- 
pendent variable and measure more than one dependent variable. 

Experiments with more than one independent variable are quite com- 
mon. Investigators frequently use this method because they suspect that the 
two independent variables somehow modify each other’s influence on the 
dependent variable. In these circumstances, we speak of the result as an 
interaction between the two variables. For example, reward and motivation 
interact m their influence on behavior. Under very low or under very high 
motivation, animals will frequently not discriminate among rewards of 
slightly different quality. But under moderate motivation — a mild degree of 
hunger, for example — animals will discriminate among qualitatively differ- 
ent rewards. Thus, it is easy to see that the effect of reward on behavior 
IS modified by the effect of motivation on behavior. 


Sometimes, too, we may use more than one dependent variable in an ex- 
periment. We may be interested in the influence of reward upon both speed 
of learning and general activity. In this case we obtain information that we 
should not get by making two separate experiments, one on speed of learn- 
ing and one on general activity level. The additional information is not 
quite m the same form as that achieved in the case of two independent 
variables; usually it comes in the form of a correlation between the two 
dependent variables. Speed of learning and general activity may be so highly 
correlated, for example, under the influence of different kinds of reward 
or motivation that we suspect there is some underlying general factor that 
accounts for the changes m both speed of learning and general activity 
Functional relationships. When the dependent variable shows change 
s a result of variations in the independent variable, it is said that the <L- 
pendent variable is a function of the independent variable. In the example 

Functional relationships are very important in the psychology^of learning 

since many of the results are quantitative and can sometimefbe expressed 
as simple mathematical functions ^ 
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because they show at a glance the nature of a functional relationship. In 
graphic presentation the independent variable is always placed on the 
horizontal axis (the abscissa) and the dependent variable on the vertical 
axis (the ordinate). A good example of such a graph is Figure 4 on page 21. 
Here the magnitude of a conditioned response (dependent variable) is 
plotted against the number of trials of training (independent variable). 


Definitions in the Psychology of Learning 

The term reward that we have been using is a common word in our 
vocabulary. It means something to everyone, but, unfortunately, nearly 
everyone interprets it in a slightly different way. If a term is to be useful 
to the experimental psychologist, it must be capable of exact definition; 
that is, it must refer eventually to a measurable fact — a concrete object or 
condition. Many words that refer to psychologically important events are 
also used in an everyday sense. The common definitions of these terms, even 
when explicitly set forth in a good dictionary, depend upon casual observa- 
tions, tradition, and the opinion of authorities. Of course, there can be as 


many definitions of this sort as there are dictionaries. 

Operational definitions. In order to make our psychological language 
as explicit as possible, we must redefine many familiar words as well as 
introduce some new words. Then we shall have to give these words clear 
operational meaning. An operational definition of a term, in the simplest 
case, IS a description of the operations performed by an experimenter or 
observer and the changes in behavior that result. Desire, reward, motives, 
etc., can all be given surprisingly precise definitions if described in this way. 
A hungry rat, according to the operational definition commonly agreed 
upon by psychologists, is one that has been deprived of food for a period 
of time after having been on an accustomed feeding schedule. 

By the same token, an operational definition of the term reward usually 
states that it is some change in the environment (presenting an animal with 
food, for example) that increases the likelihood of the animal repeating the 
behavior that immediately preceded the reward. Notice that there is no 
mention of something that the animal likes” or “something that it wants 
to get , rather, the definition is explicitly stated in terms of things that can 
always be observed and agreed upon. We may all disagree about what ani- 
uials like (unless we operationally define “like”), but most of us can 
agree that sometimes animals repeat what they have done before because 
It was followed by a reward. As a matter of fact, this is the fundamental 
aspect of voluntary learned behavior. 
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Hypothetical constructs. Of course, not all the terms used by psycholo- 
gists are given direct operational definitions. Someone, for example, may 
say that “learning is an increase in the excitatory potential for a given 
habit.” Excitatory potential in this definition is a hypothetical construct; 
it is not something that can be directly observed. Excitatory potential is a 
theoretical term invented by a psychologist to account for certain changes in 
behavior. Psychology abounds with these hypothetical constructs; as a 
matter of fact, the word learning itself is usually used in the sense of a 
hypothetical construct. Hypothetical constructs are useful ( 1 ) when they 
can be related to things that are operationally defined and (2) when they 
have wider application than an operational definition. 

The second requirement of a hypothetical construct calls for some ex- 
planation. Let us take the example of reward again. Reward is occasionally 
used as a hypothetical construct rather than an operationally defined term 
(though more usually the term reinforcement is substituted for reward). In 
such a case it may be defined by saying that it is a “slight decrement in the 
level of neural excitability.” If we define reward as a hypothetical construct, 
we must show that such a definition is useful. It would be useful if it could 
serve to relate our operational definition of a reward (in terms of change of 
behavior) to some other conditions. In this case we might argue that the 
hypothetical construct serves as a guess as to why the behavior changes. If 
this is so, we ought to be able to draw a number of implications from the 
hypothetical definition of reward and put them to experimental test. If the 
construct stands up under test, we might conclude that we had an “explana- 
tion” of the operational fact of the effects of reward on behavior. Thus, we 
see that hypothetical constructs are potentially very useful as explanations. 

Unfortunately it is sometimes very difficult to put hypothetical constructs 
to a test (the above example is particularly bad in this respect), and conse- 
quently psychology abounds in constructs that are potentially explanatory 
but are only really so if we are willing to accept a great many dubious as- 
sumptions. Nevertheless, we still find it necessary to make use of constructs 
in the following discussion, even though we know that they are not always 
well grounded in experimental fact. 

Some psychologists make a distinction between hypothetical constructs 
and intervening variables (MacCorquodale and Meehl, 1948). In this dis- 
tinction, hypothetical constructs are theoretical terms which cannot be 
entirely reduced to collections of operational definitions, while intervening 
variables are, in a sense, abstractions which can be. Intervening variables 
are terms that serve to relate a number of operationally defined independent 
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and dependent variables. One example of an intervening variable that we 
shall see much of in this book is strength of response. Strength of res|3onse 
is not a dependent variable, but it determines what will happen to de- 
pendent variables. Thus, if we suspected that quality of reward affected 
strength of response, we should have no way of measuring this effect 
directly. We could, however, measure the effect of quality of reward upon 
the likelihood of future occurrences of a response and upon the magnitude, 
or size, of future responses. We should expect both likelihood and magni- 
tude to increase with preferred rewards. We can then use the single term 
strength of response to cover both changes in probability of occurrence 
and size of responses. 

If we make a hypothetical construct out of tlie notion of strength of re- 
sponse, we add something to its definition that cannot be directly referred 
to a collection of observations. We might say. for example, that strength of 
response is ultimately defined in terms of the configuration of protein mole- 
cules on the nerve membrane. It is unlikely that we shall ever directly ob- 
serve such configurations of molecules, but such a definition could have 
important implications for our interpretation of experimental facts. 

This distinction between hypothetical constructs and intervening vari- 
ables may seem abstract and, indeed, it is. When dealing with the intrica- 
cies of psychological theory, however, it is sometimes helpful, and we shall 
have occasion to use it in many connections. While intervening variables are 
useful in limited psychological theory, whenever a theoretical psychologist 
tries to relate the theoretical terms of psychology to neurophysiological or 
physical events and theories, he makes use of hypothetical constructs. 

If we keep in mind the distinction between hypothetical constructs and 
intervening variables, as well as the distinction between these and opera- 
tional definitions, we shall find it much easier to keep fact and theory 
separate and to evaluate theory against available factual evidence. 

The remainder of this chapter discusses an important operational dis- 
tinction between two types of behavior and two types of learning. Since 
there is a question as to whether or not this distinction involves theoretical 
as well as operational matters, the importance of keeping operational and 
theoretical terms separate should be apparent at the outset. 



TWO TYPES OF LEARNING 


Learning to typewrite is different from learning to ski, if for no other 
reason than that different movements and muscles are involved. Learning to 
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typewrite and learning to ski, however, are both examples of the same 
processes of learning at work. In both cases, initial attempts are corrected 
in response to information the learner receives about their success. Most 
psychologists, however, are now convinced that not all learning involves 
the same basic processes, though they do not agree about how many proc- 
esses there are. Some psychologists have suggested several types; Tolman 
(1949), for example, suggested six different learning processes. While few 
theorists agree with this proposition, it is clear that one must make an opera- 
tional distinction between at least two types of learning when talking about 
the behavior of complicated organisms like dogs and men. Perhaps, if 
one is concerned with more primitive organisms, it is necessary to add even 
more types (Humphrey, 1933). At least two types of learning are neces- 
sary, however, and a great many investigators at different times have dis- 
tinguished between them.^ Since most of these distinctions have elements in 
common, it is apparent that many psychologists, when distinguishing two 
types of learning, have been driving at some basic principles. Let us ex- 
amine two cases of learning and see if we can abstract some of these 
principles. 

r 

T wo Examples of Learning 


Conditioned salivation in the dog. Every student of psychology is fa- 
miliar with the famous experiments on conditioning made by Ivan P. Pavlov, 
the Russian physiologist. Because Pavlov’s experiments are both well known 
and fundamental to the problem we are discussing, we can use one of thefn 
to illustrate one of our two types of learning. 

In this experiment (Pavlov, 1927), small openings were made in the 

cheeks of dogs, so that the duc t of the paro (jH gland could be 

directed outward; this allowed the saliva to be collected and measured as it 

was secreted. The dog was placed in a soundproofed room, which had a 

mall window that permitted the experimenters to watch the dog. A tuning 

fork was sounded in the room and, a few seconds later, small amounts of 

powdered meat were presented for the dog to eat. The tuning fork, of 

course, did not produce saliva, but the sight of meat and its presence in the 

mouth did After a few pairings of the tuning fork and the meat powder, 

the tuning fork was presented to the dog yvithout the meat powder. It was 

apparent that the tunmg fork had acquired the ability to elieit the salivary 
response whic h was mitially limited to meat powd er. 

Hn«ar. an. Mar.uis ( 1 . 40 ). 
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Pavlov referred to the meat powder as an ijn^;i^^tidLni)ned stimulus. An 
unconditioned stimulus is any stimulus that jias t he ability to elicit a re- 
sponse without prior training.^ Thus, when meal powder is placed in the 
mouth of a dog, a salivary reflex is immediately produced. This is called an 
unconditioned response. The tuning fork was referred to by Pavlov as a 
conditioned stimulus. Conditioned stimuli, in Pavlov's experiments, are 
those which initially do not elicit the response under study but which come 
to do so by being paired with the unconditioned stimulus. Likewise, the 
salivary response produced by the tuning fork is called a conditioned 
response because it is a learned response conditional upon the previous 
pairing of stimuli. 

This basic experiment of Pavlov’s has had an enormous influence upon 
the psychology of lea rning, and t)^^ ternns unc onditioned and conditioned 
stimuli and unconditioned an d conditioned responses are part of the basic 
vocabulary of the psychology of learning. Let us now look, however, at an 
example of learning that is quite different from the one studied by Pavlov. 

Conditioned bar pressing in the rat. Our second example is perhaps not 
quite so familiar, but it is equally important. This time we shall deal with 
the behavior of the laboratory rat, and as in the case of Pavlov’s experiment, 
this example will serve as a prototype for an important class of learning 
processes. 

A white rat is placed in a 12-inch-square box that is relatively sound- 
proof. At one end of this box is a small lever, or bar, that projects from the 
wall. This bar is connected to an automatic recording device and to a maga- 
zine filled with small pellets of rat food. When the bar is pressed, a pellet 
of food is dropped into the box from the magazine. The essentials of this 
device, usually called a Skinner box, can be seen in Figure 1 (Skinner, 
1938). 

If the rat is hungry when it is placed in the box, it will explore the box 
quite readily. It will sniff the air, paw the walls of the box, bite here and 
there. Eventually, because there is little else to do, the rat will manipulate 
the lever. Sooner or later it will depress the bar enough to release a pellet 
of food. The rat may not discover the food immediately, but when it does, 
it will certainly eat the food. It is also fairly certain that the rat will press 

^ The ability of the unconditioned stimulus to elicit a response may be because of 
learning that took place before the animal came to the laboratory. Operationally, how- 
ever, the “unconditioned” ability of a stimulus refers to the state of things at the be- 
ginning of an experiment. Obviously, if one wishes to state that a particular uncon- 
ditioned stimulus has an innate eliciting tendency, he must carefully control the life 
history of the animal. 
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the lever a second time. This time, perhaps, it will discover the pellet of 
food immediately. At this point the behavior of the rat will change dra- 
matically. Instead of resuming its casual exploration of the box, it will now 
proceed to press the lever repeatedly. As a matter of fact, if it is hungry 
enough, it will press the lever at a rate determined only by the time it takes 
to put its nose down and eat the food. 

It is apparent that the rat has learned or has been conditioned to press 
the bar. Is this example of learning like the one studied in Pavlov’s labora- 
tory? To answer this question, let us see if we can make use of Pavlov’s 



Figure 1. An example of the Skinner box. One side has been cut away to show the interior. When 

the animal depresses the lever, a device automatically delivers a pellet of food into the box. 
(Skinner, T938.) 

terms in describing this example. Our first problem arises when we try to 
identify the unconditioned stimulus. Now, it is clear that the unconditioned 
response must be pressing the bar during the course of exploration. What 
was the stimulus that elicited this response? We can guess, but we can never 
know with any certainty what the stimulus actually was. It might have been 
the sight of the bar or the tactual stimulation obtained by feeling along the 
wall. What, then, is the stunulus to which the bar pressing is conditioned 
(the conditioned stimulus)? In other words, what leads the rat to press the 
bar after it has “discovered the connection” between the lever and food? 
Again, we are reduced to a guess. 

All we do know is that we have a response (pressing the bar) that dra- 
matically changes its character after the rat has been rewarded a few times 


IMTiiiODUCTION ' ' 

for performing the response. In short, we have a response v/hich increases 
in likelihood as the result of reward. The more common psychological term 
for reward in this situation is reinforcement. We may diagram the sequence 

of events this way : 

Response Reinforcement Increase in strength 

(lever-pressing) ^ (food) of response 

The reinforcement produces the increase in the strength of the bar-pressing 
response. 

A comparison oj the two examples. There are two important differ- 
ences between the two exp>eriments we have just discussed. In the salivary- 
conditioning example, the response is directly under the control of the in- 
vestigator. Provided that the animal is in normal condition, the salivary 
response can be elicited simply by providing the appropriate stimulus. In 
the case of the lever-pressing example, however, the investigator cannot 
directly elicit lever pressing; rather, he must wait until it occurs spontane- 
ously. 

The second difference is in the action of the reinforcement. Pavlov used 
the terms reinforcement and unconditioned stimulus interchangeably. This 
usage has caused some confusion, though in a sense it is justified. The 
unconditioned stimulus is a reinforcer in Pavlov’s experiment because with- 
out it no conditioning would occur; it reinforces conditioning. The two 
terms, reinforcement and unconditioned stimulus, are not synonymous, 
however, for we have just learned that we cannot use the term uncondi- 
tioned stimulus to apply to the reinforcement in the bar-pressing example. 
The reinforcement in this case follows the response, and we do not know 
what the stimulus is that leads to the response. Thus another difference be- 
tween Pavlov’s experiment and the bar-pressing experiment is that in 
Pavlov’s experiment the unconditioned stimulus is also the reinforcement, 
whereas in the lever-pressing experiment no unconditioned stimulus can be 
identified, though a reinforcement which follows the response can usually 
be easily identified. In Pavlov’s experiment the animal is reinforced whether 
or not it gives a conditioned response, while in the bar-pressing example the 
animal is reinforced only when a response occurs. This is a very important 
difference. 

Principles Distinguishing the Two Types of Learning 

We have looked at example's of two types of learning. Let us now turn 
to the basic principles that distinguish these two types; we have already 
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been introduced to some of these principles by comparing the two examples 
with each other, but in this section we shall make the principles more ex- 
plicit and general. 

Operant versus respondent behavior. In a long series of papers and in 
his book The Behavior of Organisms (1938), Skinner has pointed out that 
we have not only two basic types of learning but also two basic kinds of 
responses. These two types of responses Skinner called r espondent s and 
operar^. Respondent^are actsj3r_b^avior di^ctly elicited by stimul i; they 
obey the classical physiological laws of reflexes, and, indeed^ the terms jcp- 
sp on dent and reflex are identical for all practical purposes. Opemnts, on the 
[ other hand, a^ acts or behavior for which th^re are jio readily, 

! Examples of operant behavior are the familiar ones of 

spontaneous activity in animals and men — monkeys pulling at bananas and 
infants crawling about the floor. Although we cannot identify a prior stim- 
ulus producing such behavior, we do not assume that the behavior is truly 
spontaneous in the sense of being without antecedents, but we recognize 
the fact that the stimuli producing these acts are beyond our immediate 
contriil. They may be under the control of some autonomously active center 
in the nervous system or under the control of some internal stimuli, such 
as those coming from the sense organs within the body, but in any event we 
usually cannot observe, much less produce, these stimuli. 

Skinner has pointed out that re^p^mdents are most characteristic of the 
autonomic n^rymiS system, while operants are characteristic of the skel^ al 
i\£iyous-syste 4 Ji. The correlation is not perfect, for the familiar knee jerk, 
which is skeletal, is a respondent. The correlation of these two types of 
behavior with different parts of the peripheral nervous system is an im- 
portant one, however, because of the role of the autonomic nervous system 

in emotion. As one might expect, respondents are especially important in 
emotional learning. 

Instrumental versus classical conditioning. We still have not examined 
the principles distinguishing two different ways of learning. The operant- 
respondent distinction generally refers to types of behavior, not types of 
learning. Hilgard and Marquis (1940) have most clearly distinguished be- 
tween two types of learning, which they call classical and instrumental 
conditioning. The essential element defining classical conditioning is that 
the reinforcement be independent of the occurrence of a conditioned re- 
sponse. It was because Pavlov’s experiments were of this type that Hilgard 
and Marquis characterized it as classical conditioning. The essential condi- 
tion defining instrumental conditioning is that the reinforcement be de- 
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pendent upon the occurrence of the conditioned response. This is the case 
in the bar-pressing example, and indeed in any case in which an animal is 
“trained to do a trick.” Most of our attempts to teach children and animals 
are of this type; we make a reward or reinforcement contingent upon the 
behavior we want the subject to learn. 

Two types of behavior and two types of learning. In the example^from 
Pavlov, the response was a respondent and the condit ioning classi^. In 
the example of the rat pressing, the lever, the response was an oj^eraut and 
the conditioning tpst rume n^. Do these correlations between respondent- 
classical conditioning and instrumental-operant conditioning always hold? 
Probably not, though in practice classical conditioning nearly always occurs 
with a respondent, and instrumental conditioning with an operant. There 
are, however, apparently mixed examples, and while we shall not deal with 
these at present, they will become important when we come to theoretical 
matters. 

There is considerable dispute as to whether the operational distinctions 
between tw'o types of behavior and between two types of learning are basic 
theoretical distinctions as well. Are these differences limited to our tech- 
niques for investigating behavior, or do they reflect some fundamental 
differences in the processes governing behavior and learning? There is a 
distinction of hypothetical constructs that corresponds to the operational 
distinctions we have been discussing, and cases which are borderline in 
operational distinctions become very important in deciding whether or not 
theoretical distinction should be made. These are problems, however, that 
we can defer for a while. 


CONDITIONS CONTROLLING SIMPLE LEARNING 


We have seen two examples of el^entary kinds of learning. Such ex- 
amples are usually called conditioning. There is no special way in which 
conditioning can be clearly differentiated from learning, but the term usually 
refers to cases of learning in which simple responses are acquired under 
simple or restricted conditions. Some investigators (Mowrer, 1947) would 
like to limit the term conditioning to the Pavlovian or classical variety, but 
such a restriction is difficult to maintain, since, as we shall see, some more 
complicated examples of learning involve elements of both classical and 
instrumental conditioning. Furthermore, the two types obey many of' the 
same laws. In this book, the word c onditionin g will be used to refer t o- iso- 
lat^ responses leaned under simple ponduioas winch preclude^pxevious-or 
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^ncurrent learning that would either help oj interfer^with the learning 
of the response^under study. These elementary examples are extremely 
interesting to the psychologist, because in them he can see at work basic 
processes that may be obscured in more complicated cases. 

Variables that determine the nature of conditioning. In the next few 

chapters, we shall look at some of the variables that determine the course 

of simple learning. These generally operate in more complicated examples 

of learning as well, but usually in ways that make them difficult to study. 

These conditions include such things as the kind and level of motivation, 

the relevance of the response and the reinforcement to that motivation, the 

scheduling of reinforcement (whether or not reinforcements are given for 

every response), time relationships between stimuli and between stimuli and 
responses, etc. 

Theories about conditioning. As we discuss the way in which these 
various conditions determine the course of conditioning, we shall also ex- 
amine the role theories play in interpreting or predicting the results of 
experiments using these variables. T ijeories make predictionSj based on tlie 
properties of hypothetical constructs, about the effects of such things ^s 
motivation upon learning and behavior. In so far as it is possible, we shall 
try to evaluate the success of various theories in making such predictions; 
in practice this is difficult to do, since theories are frequently not stated 
clearly enough to make unequivocal predictions about behavior. In one 
sense good theories are easy to prove wrong, since they make clear and 
unequivocal predictions that are relatively easy to test; poor theories some- 
times endure because they are so vaguely stated that it is impossible to 
prove them wrong. 


CHAPTER 2 


REINFORCEMENT AND LEARNING 


Reinforcement is one of the basic concepts in the psychology of learning. 
In practice it often means the same thing as the term reward. Reward is too 
general in meaning for our purposes, however; also, it is debatable whether 
its ordinary meaning applies to reinforcement in the case of classical condi- 
tioning. Therefore we shall not make much use of the term reward; instead 
we shall substitute the more exact term reinforcement. 

AN OPERATIONAL DEFINITION OF REINFORCEMENT 

Recall, for a moment, the example of instrumental conditioning given in 
the first chapter. In this example, a laboratory rat learned to press a bar, 
and this response was reinforced with a pellet of food. The food was clearly 
responsible for an increase in the rate at which the rat pressed the bar, for 
we know that without it the rat would only occasionally have pressed the 
bar. Having'eamed a few pellets of food in this way, however, it pressed 
the bar just about as rapidly as it could, pausing between responses only 
to eat the pellets. 

If, instead of following the bar-pressing response with a pellet of food, 
we had used the sound of a buzzer, would the rat have learned to press the 
bar? Probably not, our common experience with rewards would tell us 
(though, it should be remarked, sometimes our common experience can 
lead us to the wrong conclusions) . Here, however, our common sense would 
be right; in this situation food is a reinforcement and a buzzer is not. Food 
is reinforcing because it can produce an increase in the frequency of the 
response that it follows. Obviously, because the buzzer cannot produce 
such an increase, it is not a reinforcement. 

15 
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A definition of reinforcement. With this example in mind, we can now 
look at a formal definition of reinforcement. A reinforcement is any stimulus 
event that will increase or maintain the strength of a response or stimulus^ 
response connection associated with it. A reinforcement in the case of 
operant behavior is a stimulus that follows the occurrence of the response 
and rewards it; a reinforcement in respondent behavior is the stimulus that 
elicits the unconditioned reflex. 

The one feature of this definition that has not yet been made explicit is 
the phrase strength of response. Strength of response is properly an inter- 
vening variable (see Chapter 1), something that cannot be directly ob- 
served. Rather we infer the strength of a response from some aspect of 
the response that we can measure. Thus, in order to specify the formal 
definition of reinforcement completely, we need to say exactly what de- 
pendent variables can be used to indicate the strength of response. 

Strength of response. The most important indicator of strength of re- 
sponse is a measure ordinarily called probability of response. This is the 
likelihood that a response will occur in a given unit of time or when a par- 
ticular stimulus is presented. Thus if a response occurs frequently in a given 
unit of time, it is a strong one. For rats in the Skinner box, a strong tendency 
to press the lever would be indicated by a rate of 10 responses per minute, 
while a weak tendency would be 1 response per minute. 

Some investigators prefer to limit the measurement of strength of re- 
sponse to probability (Skinner, 1938, 1950); and still other investigators 
try to reduce all other possible indicators of strength of response to some 
form of probability of occurrence (Estes, 1950; Mueller, 1950). In prac- 
tice, however, many other indicators are used. Two of these lire reaction 
time or latency (Graham and Gagne, 1940) and amplitude or magnitude 
of response (Hovland, 1937). For the time being we shall not be too 

concerned with the problems created by recognizing several indicators of 
strength of response. 

Testing the effects of reinforcement. In the case of operant 'behavior 
and instrumental conditioning, the methods of assessing the effects of rein- 
forcement are quite simple. Usually the response before reinforcement is 
very low in strength; the probability of occurrence is very small. After a. 
few reinforcements, however, the probability of occurrence per unit time 
changes rapidly. Figure 2 shows this effect taking place. In Figure 2 'tl^ 
cumulative frequency of responses ig- plotted against time. Thus when th^ 
frequency of responses per unit time is low, as it is at the outset of the ex- 
periment, the responses cumulate very slowly; the result is a nearly hori- 
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zontal line with occasional bumps which show the occurrence of responses. 
When the response becomes conditioned, however, the response occurs 
much more frequently, so that the line goes up rapidly. The actual learning 
itself, in this example, is shown by the change in the slope of the curve. 

In classical conditioning the situation is usually more complicated. As 
explained in the first chapter, the reinforcing stimulus in the classical condi- 
tioning of a respondent is the unconditioned stimulus. Thus, the stimulus 
that elicits the response in the first place also provides the reinforcement. 
The conditioned or learned response, however, occurs to the conditioned 
stimulus, and because the conditioned stimulus usually precedes the un- 
conditioned stimulus, the conditioned response usually “anticipates” the 
unconditioned stimulus. This is not necessarily the case, however, for it is 



Figure 2. A cumulative curve of instrumental conditioning. All responses were reinforced. There 
is no evidence of learning following the first three responses, but the fourth is followed by a rapid 
change in rate of responding. (Skinner, 1938.) 


possible to condition a respondent when the unconditioned and conditioned 
stimuli are simultaneous. If they occur simultaneously, then we must use 
special “test” trials to find out whether conditioning has taken place. With- 
out these trials we should have no way of knowing whether a particular 
response was the result of the conditioned or of the unconditioned stimulus. 
If we make the two stimuli simultaneous and then test for a conditioned 
response by omitting the unconditioned stimulus on some trial, we can de- 
termine whether or not we have a conditioned response. According to the 
definition of reinforcement, however, prior association of the conditioned 
stimulus-response connection with the unconditioned stimulus or reinforce- 
ment is necessary, so that if we omit the unconditioned stimulus on too 
many successive test trials the conditioned response will disappear. Inci- 
dentally, in the example of salivary conditioning, the indicator of strength 
of response is usually one of magnitude of response — the amount of saliva 
given on any trial. 
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Trans-situational reinforcers. The definition of reinforcement given 
above is open to the charge of being circular — it is only a definition, not a 
law. There are laws about reinforcement, however, and one of them is that 
most reinforcers for operant behavior are trans-situational (Meehl, 1950). 
This means that most reinforcements will strengthen all or most leamable 
responses in a given species. In other words, if a particular stimulus 
event is reinforcing for a bar-pressing response in the rat, it ought to be 
reinforcing for other responses, such as running a simple maze. Turning out 
a light is reinforcing for the laboratory white rat. This reinforcement can 
be used to teach a rat to press a lever that turns out the light or to run a 
simple maze that has a dark box at the end. 

Respondents are trans-situational in the sense that one can use almost 
any conditioned stimulus for a given response, but because of the elicited 
character of the respondent, the unconditioned stimulus cannot be used as 
a reinforcer for more than a few closely related respondents. Thus re- 
spondents are not trans-situational so far as responses are concerned. 

Meehl (1950) proposes that all reinforcers (at least for operants) are 
trans-situational and that every increase in strength of response or learning 
involves the use of a trans-situational reinforcer. This is probably not ex- 
actly true, for later in this book there are examples of operant reinforcers 
that are specific to certain responses, but it is probably true that most of 
the reinforcers of everyday life are trans-situational. This means that one 
can use almost any reinforcement to teach almost any response; the success 
of the reinforcement for a particular response will not depend upon that 
particular response-reinforcement connection but upon the general ade- 
quacy of the reinforcement. If some stimulus is not reinforcing for bar 
pressing, it wifi not be reinforcing for learning a maze. 

CONDITIONS OF REINFORCEMENT 

Let us now look at some of the experiments that tell us about the condi- 
tions under which reinforcement occurs and how these conditions modify 
the effects of reinforcement. These experiments are important because they 
tell us how behavior is determined by reinforcement when other variables 
either are held constant or interact with reinforcement. Since reinforcement 
is such an essential mechanism in the behavior of organisms, one can 
ook upon this section as a fundamental introduction to the psychology of 
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and the other of which does not. These other examples would show char- 
acteristics much like those of the two cases we have cited. Response 
strength would receive its greatest increment with the first or one of the first 
reinforcements, and with successive reinforcements the increase in response 
strength would become progressively smaller. This uniformity in simple 
learning curves has led psychologists to believe that there is a fundamental 
learning process that can be mathematically expressed as a theoretical learn- 
ing curve. Let us glance at the properties psychologists have proposed for 
such a curve and the equation that governs it. 



Figure 4. Amplitude of the galvanic skin re»ponse as a function of the number of reinforcements. 
{Hovland, 1937a.) 

A theoretical curve. The exact shape of the fundamental learning curve 
—the one relating response strength to number of reinforcements— has 
occupied the interest of many theoretical psychologists. Investigators have 
tried to predict the shape of simple learning curves from mathematical 
assumptions and then to apply these predictions, in turn, to more compli- 
cated relationships in psychology. We shall now examine one of the most 

widely accepted of these attempts to discover the basic properties of the 
fundamental learning curve. 

Here are three of these basic properties that should be expressed in any 
mathematical or theoretical learning equation: 

1 . Strength of response is an increasing function of the number of rein- 
forcements or reinforced trials. 
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2. Strength of response increases only to some limit; beyond this there is 
no further increase unless the conditions of the experiment change. This limit 
or asymptote may vary with different situations and different species of 
animals, but with the same animals and the same situation, it is a constant. 

3. Each successive reinforcement adds smaller and smaller amounts of 
response strength. 

With these characteristics in mind let us consider how strength of re- 
sponse R grows with number of reinforcements N. With each increase in 
number of reinforcements, there is some increase in strength of response. 
We can put this in the form of a ratio, 

A/? 

AN 

where A means increase or increment. This expresses characteristic 1. 
Characteristic 2 says that strength of response can increase only to some 
limit. Let us call this limit M. Characteristic 3 suggests that as strength of 
response gets close to the value of M, the increases in strength of response 
grow smaller. Thus as the difference M — R becomes smaller, AR be- 
comes smaller. All of this can be put in the form of an equation: 

-EN^km-R) 

This equation says that the increment in strength of response is proportional 
to the difference between the maximum strength possible and the amount 
already accumulated. As this difference becomes zero, increase in response 
strength becomes zero — no further increase is possible.^ 

The above equation may be written as a differential equation and in- 
tegrated. This yields 

R = M ~ Me-^^ 

To those with little or no mathematics, this equation may not mean much. 

Suffice it to say that it describes learning curves of the sort presented earlier 
very well. 

This equation, or something very close to it, has frequently been a basic 
assumption in theories of learning (Hull, 1943; Estes, 1950; Estes and 
Burke, 1953; Bush and Mosteller, 1953). Sometimes it is elaborated to 
provide for the possibility that the first reinforcement may not always pro- 
duce the greatest increase in strength, that there may be a warm-up period, 

^ Some investigators have proposed quite different properties for basic learning 
curves. See, for example, Gulliksen (1934). 
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so that the largest increments come after the first few reinforcements. In 
this case the curve will be S-shaped. Despite the possibility of minor modi- 
fications, the ideas presented above tell us about the basic elements that go 
into theoretical learning curves. Sometimes these elements are described in 
terms of hypothetical neural events or samples of stimuli conditioned on 
each trial, but in any case the characteristics outlined remain the same. 

I. 

Amount and Type of Reinforcement and Strength of Response 

V 

The fundamental fact about learning is that behavior changes progres- 
sively as a function of the number of reinforcements. This relationship, 
however, is modified by a number of conditions of reinforcement, such as 
the amount and quality of the reinforcing agent. In the case of food rein- 
forcement, for example, a large amount of food results in a greater strength 
of response than a small amount (Grindley, 1929). In itself this is trivial, 
for it is limited to the kind of animals used in the experiment (chickens, in 
this case), the particular response, and the kind of reinforcement. Other 
experiments, however, give us grounds for believing that this relationship 
is generally true for simple conditioned operants and food reinforcement. 
Guttman (1953), for example, finds that it is true of the lever-pressing 
response in rats for various concentrations of sugar and that it is also true 
of peanuts used as reinforcement for problem solving in monkeys (Harlow 
and Meyer, 1952). 

Even if we use a large experimental literature to establish a general law 
about strength of response and amount of reinforcement, it seems rather 
trivial, if for no other reason than that it appears to be obvious. There is 
something about this relationship that is not quite sb obvious, however, 
and that is the way in which amount of reinforcement affects behavior. The 
exact way in which this happens involves an important distinction in the 
psychology of learning. 

The distinction between learning and performance. For some time psy- 
chologists have been aware that there could be a difference between what 
an animal learns and what it does. It is possible, for example, that a rat 
could learn that pressing the lever in the Skinner box leads to food but 
would not show that it had learned, perhaps because it was afraid of the 
noise the food-delivery apparatus makes. Or a rat may know the location 
of food in a maze, but because it is not hungry, it may not show any par- 
ticular disposition to go to the food. Thus, the fact that an animal does not 
perform very well in a learning test does not mean that the animal has not 
learned what it should. It is possible, however, to devise special tests that 
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make it possible to tell whether an animal has learned or not, even when its 
performance is poor. 

This distinction between learning and performance is important to the 
question of how amount of reinforcement affects behavior, for there is a 
good deal of evidence to suggest that amount of reinforcement affects per- 
formance but not necessarily learning.'^ 

Two investigators (Crespi, 1942, 1944; Zeaman, 1949) have attempted 
to separate the effects of amount of reinforcement on learning from the 
effects on performance by ( 1 ) training rats to run a simple runway, such as 
the one described earlier in this chapter, under various amounts of food 
reinforcement, and then (2) switching the amount of reinforcement when 
the rats had reached a stable level of performance. The results of these 
studies are well defined. The amount of reinforcement has no effect on the 
rate at which animals approach the final level of performance, but it does 
affect the final level they achieve. In other words, after learning is com- 
plete, animals that have received different amounts of reinforcement do 
perform at different levels, but they do not learn at different rates. Thus, the 
limit of response strength M is affected by the amount of reinforcement, 
but the rate at which this limit is reached, k, is not. Furthermore, if one 
trains a rat with one amount of reinforcement and then suddenly shifts to 
a higher amount, the change in performance is almost instantaneous — the 
rat does not have to learn to appreciate the new amount of reinforcement; 
it produces an instantaneous change in the rat’s behavior. 

Both of these studies used a very simple response. The only thing the 
rats had to do was to run down a runway to get food placed at the end. 
One investigator (Crespi, 1944) measured speed of running, and the other 
investigator (Zeaman, 1949) measured latent period. Other investigators 
who have studied more complicated examples of learning have reported that 
amount of reinforcement makes no difference at all. For example, in teach- 
ing rats to discriminate between a black goal box and a white goal box by 
placing food in one and not in the other, it was discovered that the amount 
of food makes no difference in either learning or performance (Reynolds, 
1949). In another example amount of food reinforcement made little differ- 
ence in performance in a complicated maze, except when no food at all 
was given (Furchtgott and Rubin, 1953). There is a good reason why these 
examples of more complicated behavior, when compared with the simple 
running response, show little or no effects of amount of reinforcement. In 
the runway case, the measures of behavior are more a reflection of per- 
formance than of learning. There is very little to learn on the runway. In 
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the more complicated examples, because the animals have relatively didicult 
problems to learn, the measures of behavior more nearly reflect learning. 
Because the behavior reflects more learning than performance, amount of 
reinforcement does not appear to change behavior very much. If no food 
at all is given, however, performance deteriorates badly. 

>^'here are several other studies in tlie experimental literature (Maher and 
Wickens, 1954; Reynolds, i950a, i950b) which support the conclusion 
that learning is usually independent of amount of reinforcement while per- 
formance may not be. There is, however, a fundamental problem that is 
raised by one of these studies. 

In this study the investigator tried training rats in a maze witli no rein- 
forcement at all. It would seem reasonable, by extending the principle that 
we have already stated, to say that the absence of reinforcement affects 
performance, not learning. This is fundamental because it suggests that rein- 
forcement is necessary for performance but not for learning itself. This 
problem is so basic and so controversial that it has not yet been fully settled. 
For years investigators have been trying to find ways to decide whether 
reinforcement affects only what an animal does or both what it does and 
what it learns. One of the ways in which they have tried to reach a decision 
in this matter is to compare learning under some reinforcement with learn- 
ing under no reinforcement. Such experiments are called latent-learning 
experiments. 

Latent learning. The latent-learning experiments grew out of the theo- 
ries of E. C. Tolman (1932, etc.). One of Tolman’s principal ideas was 
that reinforcement — or reward — affected performance but had little or 
nothing to do with learning. The latent-learning experiments were designed 
to test this idea, and one of the earliest was made by Blodgett ( 1 929 ) 

Blodgett allowed one group of hungry rats to explore a maze that did 
not have food in the goal box. This group of rats, therefore, was not rein- 
forced, argued Blodgett (though this argument is suspect, as we shall see). 
A second group of rats explored a maze with food in the goal box, and 
therefore it received the conventional reinforcement. 


There was a difference between the behavior of these two groups. The 
group which was reinforced with food seemed to learn on schedule, but the 
unreinforced rats showed little improvement. After seven days, the rats in 
this group were entering almost as many blind alleys as they had on the first 
day. At this time, however, Blodgett put food into the goal box for these 
rats. Their performance suddenly and dramatically improved, so that almost 
immediately they made as few errors as did the previously reinforced rats. 
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These results can be seen in Figure 5. They show us that the rats which 
did not receive the food as reinforcement were able to profit from their ex- 
perience in the maze, though the results of such experience did not show up 
until a reward was provided. Because the learning that apparently took 
place during the unreinforced trials was not evident until food was intro- 
duced, the learning was said to be “latent.” 

Many investigators have repeated this or similar experiments (Thistle- 
thwaite, 1951), but not all investigators have been able to find what 
Blodgett found. A careful repetition of Blodgett’s study (Reynolds, 1945b), 



Figure 5. Number of errors during reinforced and nonreinforced trials in maze learning. When 
reinforcement is introduced, errors drop to o level comparable to that of animals continuously rein- 
forced. (Ooto from Blodgeif, 1929.) 

for example, showed that the absence of food in the goal box does not 

necessarily mean the absence of reinforcement, for in this repetition animals 

that were not reinforced with food nevertheless showed improvement during 

the latent period, although they showed less than animals reinforced with 

food. Actually, as shown in Figure 5, Blodgett’s animals learned a little as 

they explored the maze for seven trials without food. A later experiment 

(MacCorquodale and Meehl, 1951) revealed some of the things that were 

reinforcing in the absence of food, such as escape from situations that were 
anxiety-provoking for the rats. 

It IS clear that Blodgett’s experiment is rather like the studies on amount 
of reinforcement. The group that did not receive food was weakly reinforced 
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and did not improve very much until given a strong reinforcement; then 
improvement occurred on one trial. Thus, Blodgett's study suggests, as do 
the studies on amount of reinforcement, that reinforcement affects per- 
formance, but not rate of learning. 

Two more experiments on latent learning deserve mention. In these some 
of the critical weaknesses of Blodgett’s study are removed. In one (Buxton, 
1940), rats were permitted to live in a maze for a period of several days. 
They were placed in the maze at different positions and removed from it 
at different positions. Consequently, the animals explored all parts of the 
maze to about the same extent, and they were not differentially rewarded by 
being removed from any particular part of the maze. 

The important result of this experiment is that it was not necessary to 
give the rats one rewarded trial before the sudden improvement in per- 
formance occurred. Instead of allowing the rats to run through the maze to 
discover the food the first lime, as Blodgett had done, the experimenter put 
them directly into the goal box, so that they had a chance to see the food. 
When they were put at the beginning of the maze, most of the rats were 
able to find their way to the goal box without making many errors. 

A second experiment (Seward, 1949) confirms this result, and these two 
experiments taken together show that it is possible for rats to learn the 
general plan of a simple maze when they are not differentially reinforced 
in any particular part of the maze. Though again, until the food is intro- 
duced as a reinforcement, performance does not show what the rat has 
learned. 

Time of Reinforcement ^ 

Another important variable in the instrumental learning of operant be- 
havior is the time between the response and the reinforcement. This is not 
important in classical conditioning because there is no well-defined way of 
varying the time between the conditioned response and the reinforcement, 
though we may vary the time between the conditioned stimulus and the 
reinforcement (unconditioned stimulus). We shall examine this problem 

later. For the present we shall restrict ourselves to examples of operant 
behavior. 

In an experiment by Perin (1943) we can see the basic nature of the rela- 
tionship between time of reinforcement and strength of response. Perin 
used a Skinner box and rats. He slightly modified the usual procedure in 
the Skinner box so that a movement of the lever either to the right or left 
would produce a pellet of food. After the animals had learned to move 
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the rod in both directions and had displayed their preference for one direc- 
tion, the apparatus was changed so that a movement of the rod in the 
preferred direction would give no food. A movement of the rod in the other 
direction, however, would cause the food pellet to be delivered. 

The most important factor in this experiment is that the time between 
response and reinforcement was varied, ranging between 0 and 30 seconds 
for different rats. As illustrated in Figure 6, the results were clear. The 
sooner the reinforcement was delivered, the higher the rate of learning. As 



Figure 6. Slope of the learning curve as a function of time between response and reinforcement. 
The curve is extrapolated, on the basis of the five plotted points, to show zero learning at a delay 
between 30 and 45 seconds. {Data from Perin, 1942.) 

a matter of fact, the rats with longer time delays learned so slowly that the 
experimenter predicted they would not leam at all if the time between 
response and the reinforcement were extended to about 45 seconds. 

While Perin s experiment is a simple illustration of the basic relation- 
ship between time of reinforcement and response strength, it raises im- 
portant questions. For one thing, there is a marked difference in the length 
of gradient achieved by Perin and that found by other investigators 
(see Hull, 1943). The results of the Perin study lead us to predict that if the 
delay had been 45 seconds, the rats would not have learned to press the lever 
at all. Other studies, however, show that under proper conditions the 
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delay between a response and a reinforcement can be extended to approxi- 
mately 20 minutes and rats will still learn (Wolfe, 1934). W'e shall deal 
with this problem shortly. 

The other problem created by this experiment is indicated by the label 
“Slope of learning curve/' on the ordinate of the graph in Figure 6. Now 
this plotting suggests that time of reinforcement, rather than amount of re- 
inforcement, affects both the limit and the rate of learning, and, indeed, this 
is the conclusion of an important theorist (Hull, 1951 ). Let us look at the 
implications of this finding. 

Time of reinforcement and learning versus performance experi- 
ments subsequent to Perin's show that the situation is complicated. Logan 
(1952) tried an approach similar to that used in the learning versus per- 
formance problem pertaining to amount of reinforcement; he trained rats 
to a steady level of performance under two different delays of reinforce- 
ment and then switched these delays. Initially, performance was poorer for 
a long delay than a short one. Animals that had been trained on a long delay 
gradually improved in performance when switched to a short delay, while 
the converse was true of those switched in the opposite direction. The fact 
that the change was gradual and not immediate suggests strongly that delay 
of reinforcement affects both learning and performance. A later experiment 
by Seward and Weldon (1953) shows essentially the same result. 

These findings pose a problem since most theories find it difficult to ac- 
cept the fact that amount of reinforcement affects only performance while 
delay of reinforcement affects both learning and performance. No simple 
explanation of this is possible, but let us look at a somewhat rudimentary 
version of one possible answer (Seward and Weldon, 1953). 

One of the things animals have to learn in psychological laboratory tests 
is the relationship between what they do and the presence of the reinforce- 
ment. Put anthropomorphically, sometimes it seems as if the animals have 
a different notion about what they are supposed to be doing than the experi- 
menters do. Skinner (1948) has pointed out that animals sometimes per- 
form responses which are not necessary to obtain the reinforcement; this 
Skinner called “su perstitiou s" behavior in animals. For example, a rat may 
appear to have the idea that it is necessary to turn around once counter- 
clockwise before pressing the lever. From Skinner’s demonstration it ap- 
pears likely that these false correlations between behavior and the reinforce- 
ment occur most frequently when reinforcement is delayed. In other words 
there is more uncertainty about which response is being reinforced when 
there is a delay between a particular response (such as lever pressing) and 
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the reinforcement. Since the reinforcement is sometimes connected by the 
animal to incidental behavior, the learned dependency between reinforce- 
me^ and the response that actually produces it is weaker. 

implication of this explanation is that we should reconsider the. t 
relationship between amount of reinforcement and learning. It is all right to 
say that any amount of reinforcement will have about the same effect on 
learning as any other amount, but any response the animal learns will prob- 
ably be weaker without reinforcement than it would be if it were reinforced. 
To put it anthropomorphically again, an animal might learn many things 
because they have equally interesting consequences, but because the animal 
learns these things more or less simultaneously, no one of them will be very 
strong. While it is clear that animals can learn in the absence of a con- 
sistent reinforcement, the learning may be weaker than it would be if such 
reinforcement were present. This seems to have been the basis for Tolman’s 
statement that reinforcers are necessary to performance but serve only as 
emphasizers for learning (Tolman, 1938). If the situation is simple, it 
is likely that there will be no difference between reinforced and unrein- 
forced learning, but in more complicated situations reinforcement may play 
a role in determining what animals learn. 

Let us now turn to the other principal problem inherent in delay of rein- 
forcement. Under some conditions reinforcements are effective after a long 
delay, while under others they lose their efficacy after a brief delay. Hull 
(1943) showed that the answer to this problem involved an important 

principle in the psychology of learning, the principle of secondary rein- 
forcement. 

Secondary reinforcement>y^ Secondary reinforcement applies to both 
classical and instrumental conditioning, so let us look at examples of second- 
ary reinforcement in both types of learning. 

In classical conditioning we can draw on an experiment of Pavlov’s 
(1927). Here Pavlov used the ticking sound of a metronome as the condi- 
tioned stimulus and food as the unconditioned stimulus for salivation. After 
a few pairmgs of the metronome with the food, the salivary response be- 
came conditioned to the metronome. So much, of course, is only a typical 
conditioning experiment. Pavlov, however, carried this experiment one 
step further. He presented a black cardboard square to the dog. As we 
might expect, no saliva flowed to this stimulus. The black cardboard square 
was then paired with the ticking metronome ten times. By the tenth trial, 
the dog gave a salivary response to the black square, though it had 
never been paired with food, the unconditioned stimulus. This response, 
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learned without the original uncomditioned stimulus, Pavlov called a higher- 
order conditioned response, but today we generally speak of this as second- 
ary reinforcement. 

t An experiment on maze learning provides an example of secondary 
reinforcement in instrumental behavior (Saltzman, 1949). In this experi- 
ment rats were trained to run down a straight runway to a goal box that 
contained food. The goal box was either black or w'hite. In one condition, 
the goal box always contained food when it was black but did not when it 
was white. After this training, the rats were taught a simple maze in which 
they had to choose between tw'o pathw'ays, one leading to a black goal box 
and the other to a w'hite one. The rats learned to go to the black goal box, 
even though it was never baited with food. Thus it is clear that the black 
goal box itself was reinforcing because it had been paired with a primary 
reinforcement, food, in another situation. The black goal box was thus a 
secondary reinforcement. 

A general definition of secondary reinforcement is that it is any stimulus 
that acquires reinforcing power because it is paired with a primary or un- 
learned reinforcement. It is easy to see that the concept of secondary rein- 
forcement is important, for it accounts for learning in which there seems 
to be no reinforcement based upon a known biological drive. However, we 
may ask, what does secondary reinforcement have to do with the time be- 
tween a response and its reinforcement? Since this is an important question, 
let us examine it in detail. 

Time of reinforcement and secondary reinforcement. The principle of 
secondary reinforcement enables us to describe why delaying reinforcement 
only sometimes has a deleterious effect upon learning. We mentioned earlier 
an experiment on lever pressing in which learning practically disappeared 
if the reinforcement was delayed much more than half a minute, while in 
another experiment on maze learning the reinforcement could be delayed 
for almost half an hour and the rats would still learn. Let us compare 
the two. 

In the maze learning experiment (Wolfe, 1934) the animals had to learn 
to go to one arm of a T-shaped maze. The goal box at the end of this arm 
contained food, and the identical goal box in the other arm did not. The 
maze had doors just before the goal boxes, so tha. the animals could be 
retained in the arms of the T before they were allowed to enter the goal 
box. Thus the arms of the T served as retention chambers. 

It is the retention chamber that explains why delay did so little harm to 
learning in this situation. Since the retention chamber in one arm always 
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immediately preceded food, it came to be a specific secondary reinforce- 
ment. Thus, no matter how long the animals were delayed before they 
entered the goal box, they could leam to turn to the side with food — simply 
because of the secondary reinforcement associated with this particular 
retention chamber. 

In the lever-pressing experiment (Perin, 1943), the experimenter went 
to a good deal of trouble to eliminate all secondary reinforcement. He built 
the apparatus so that the rod was removed immediately after the rat had 
pressed it; this stimulus, which might have become a secondary reinforce- 
ment, was therefore never present when the rat actually got the food (except 
when the delay between response and reinforcement was zero). Thus there 
was minimal opportunity for the lever to become a secondary reinforcement. 
As a result, the amount of delay between the primary reinforcement and the 
response had to be short if the animal was to leam, for there was little or 
no secondary reinforcement to bridge the gap. 

The gradient of reinforcement. The relation between response strength 
and delay of reinforcement has been called the gradient of reinforcement. 
This refers to the fact that the more immediate the reinforcement, the 
stronger the response. We have seen that if no opportunity for secondary 
reinforcement is present, and if there is any delay, however short, between 
response and reinforcement, the animal will not show that it has learned. 

Spence ( 1947) has made the suggestion that any delay between response 
and reinforcement is mediated by secondary reinforcement. That is to say, 
when all cues for secondary reinforcement are eliminated, there may be no 
learning unless the response and the primary reinforcement occur simulta- 
neously. This is probably something of an exaggeration. We have already 
seen that reinforcement seems to exert a primary influence on performance 
and only a secondary one on learning. So it is that animals will sometimes 
learn with no apparent reinforcement, though they will not perform well. 
Nevertheless, Spence’s point is well taken. Most of the delay between a rein- 
forcement and a particular response is probably bridged by secondary re- 
inforcement. A variety of experiments have shown that it is possible to 
change the extent of the gradient of reinforcement by changing the possible 
secondary reinforcements (Perkins, 1947; Grice, 1948; Smith, 1951). 

THEORIES OF REINFORCEMENT 

We have had little to say thus fat about theories of reinforcement, and 
yet for many years these have been of major concern to psychologists inter- 
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ested in learning. Although it is certainly true that the question of why 
reinforcement occurs is important, we have deferred discussion of this 
problem because many basic issues are still unsettled. On several important 
matters we cannot even state any very firm conclusions. Some questions, 
however, have been settled by experimental work, and theorists have come 
to some agreement on still others. Let us look at the basic issues and see 
how some of them have been tentatively settled, while others have remained 

controversial. 


Basic Issues in Theories of Reinforcement 

In the following section we shall examine some of the more important 
issues involved in the various theories of reinforcement and the attempts 
that have been made to resolve them. Two problems are of paramount im- 
portance. One is the question of why certain stimuli appear to be rein- 
forcing and others do not. The other is whether reinforcement determines 
learning or performance. In addition there is a subsidiary problem of 
whether or not the same principles of reinforcement apply to both re- 
spondent and operant behavior. 

Why are sortie stitnuli reinforcing? It is fairly clear that there are only a 
limited number of stimuli that are reinforcing to every animal. Animal 
trainers know this, of course, and they are careful to reward the behavior 
they want to teach their animals with the proper reinforcement. If we are 
teaching tricks to dogs, a simple pat on the head will reinforce most of the 
time, though food is probably better. With the domestic cat, a pat on the 
head is usually not enough. Indeed, it is possible that the difficulty people 
have in teaching cats may be due in part to a more limited class of rein- 
forcing stimuli. 

Although it does not sound very enlightening to say that reinforcing 
stimuli are things that animals like, this common-sense definition contains 
the germ of some of our most important theoretical ideas about reinforce- 
ment. 

The origins of present interpretations of why some stimuli reinforce ani- 
mals and others do not probably come from the theory of natural selection. 
Darwin and his successors tried to explain the process of species evolution 
in terms of natural selective breeding. The adaptive characteristics of or- 
ganisms survive, said Darwin, and the unadaptive ones die out. Naturalists 
have pointed out many examples of anatomical and physiological character- 
istics that are adaptive and that seem to be perpetuated through selective 
breeding. Thus in evolution, biological changes in organisms have “good” 
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and bad effects, and, according to the natural selection theory, only the 
changes with “good” effects survive. 

It was apparent to many early workers in evolution that behavioral pat- 
terns also provide good examples of the survival of adaptive mechanisms. 
Furthermore, in the lifetime of an individual there appears to be a process 
of selective adaptation. Organisms seem to learn those things which are 
useful they learn the location of food supplies, water, hiding places, etc. 
The early literature of comparative psychology abounds with anecdotes 
illustrating the survival value of the things animals learn. 

Thus, the argument goes, adaptive behavior (that which preserves the 
animal from harm and keeps it fed and sheltered) survives and becomes 
learned, whereas unadaptive behavior does not. This concept was combined 
with the hedonistic ideas of Utilitarianism. Pleasure, it was held, is asso- 
ciated with adaptive behavior and pain with nonadaptive behavior. Thus 
pleasure and pain were brought into an adaptive theory. 

This kind of notion — a behavioral hedonism combined with the sur- 
vival of adaptive traits — has had an enormous influence on the theory of 
learning and behavior. It is hard to find theorists who deny that reinforcing 
stimuli are connected with adaptive mechanisms in the animal. Many 
theorists deny the universal importance of adaptive reinforcement, how- 
ever, and others say that, while it determines what the animal will do, it 

has nothing to do with what the animal learns. Thus we face once more this 
issue of learning versus performance. 

Reinforcement in learning and performance. The question of whether 
reinforcement determines learning or merely performance is a relatively 
new one. Most of the older investigators did not make a distinction. E. L. 
Thorndike, one of the first experimental psychologists to explore the adap- 
tive characteristics of reinforced behavior, never clearly stated whether he 
thought that reinforcement affected performance or both learning and per- 
formance. In his Principle of Effect (1898), which led to our current opera- 
tional notions of reinforcement, he said that rewards tend to stamp in 
behavior. This and other of his statements imply that reinforcement acts 

upon both what the animal learns and what it does — indeed that reinforce- 
ment is necessary for learning. 

This implication from Thorndike had considerable influence, and for a 
long time the dominant view was that an adaptive reinforcement was neces- 
sary for both learning and performance. Hull (1943) was the most influ- 
ential theorist to adopt this viewpoint. He held that adaptive reinforcement 


35 


REINFORCEMENT AND LEARNING 

was necessary for the learning of both operant and respondent behavior. 
Hull was a quantitative-learning theorist and wrote nearly all his hypotheses 
as equations. It is interesting to note that near the end of his career Hull 
drastically revised his theory by rewriting his equations so that performance 
but not learning was determined by reinforcement (Hull, 1951). 

The need-reduction theory of reinforcement.' Hull's original belief in 
the importance of reinforcement for both learning and performance led him 
to speculate on the mechanism of reinforcement, and his conjectures have 
provided a specific theory that is valuable whether reinforcement affects 
learning and performance or just performance. This is the need-reduction 

theory of reinforcement. 

The need-reduction theory states that reinforcements are always related 
to specific motive states in animals and men. They reduce the level of or- 
ganic or secondary need derived by a process of learning from some organic 
need. Thus food is reinforcing for a hungry animal, just as water is for a 
thirsty one. Other drives, sex, for example, are also reduced by appropriate 
reinforcement. Most experimental work, how'ever, tends to deal with hunger 
and thirst since both the level of drive and its appropriate reinforcement 

are easier to control. 

There is little doubt that drives and their reduction control the strength 
of responses. Whether all behavior is directly or indirectly controlled by 
need reduction is impossible to answer, but there is experimental evidence 
of the control exerted by factors such as reduction of hunger. An experi- 
ment by Miller and Kessen (1952), for example, shows that milk intro- 
duced directly into the stomach of rats by way of a fistula can serve as a 
reward in maze learning. The rats in this experiment could not taste the 
milk, so it was obvious that the milk was not reinforcing because of its taste 
or because of “consummatory activity”; furthermore, a control experiment 
in which saline was used instead of milk made it clear that the critical factor 
is not just something in the stomach but rather the nutritive value of food. 

Not all behavior is controlled by primary reinforcement, according to this 
theory. For, of course, secondary reinforcements can be built upon primary 
reinforcement. Secondary reinforcements, moreover, involve secondary 
drives (Miller and Dollard, 1941). It is uncertain whether or not secondary 
reinforcements reduce secondary drives as primary reinforcements do 
primary drives. In the case of fear (a secondary drive) it is thought that 
escape from fear (a secondary reinforcement) does result in the reduction 
of the secondary drive, but this seems to be a somewhat special case, so we 
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cannot make any general statement from it. It is quite possible, however, 
that secondary reinforcements do involve some reduction in a need anal- 
ogous to the reduction of a primary drive by a primary reinforcement. 

In recent years investigators have found evidence for a particular system 
in the brain that, when stimulated, reinforces behavior (Olds and Milner, 
1954). Since such stimulation does not reduce a biological need, it is argued 
that need reduction itself is not necessary for reinforcement. In the usual 
course of things, however, need reduction may activate this reward system 
in the brain (Olds, 1955). Although need reduction is what normally dic- 
tates animal behavior, it is not the essential element. Perhaps activation of 
a particular part of the brain is the important factor. 

Furthermore, tjiere is reason to believe that there is a direct sensory, 

or hedonic, kind of reinforcement (Young and Shuford, 1954). That is to 

say, some things are reinforcing because they have the right kind of sensory 

consequences, not because they satisfy some biological need. Thus saccharin 

is reinforcing for rats (Sheffield and Roby, 1950) even though it has no 

nutritive value. Presumably it is reinforcing only because it affects the right 
taste receptors. 

Thus, while the original need-reduction theory seems rather naive now, 

it is only because we have a better understanding of the possible ways in 

which reinforcement controls behavior. Need-reducing reinforcements affect 

behavior to some extent, but they are only part of the story. Furthermore, 

along with Hull, most theorists have come to the view that need reduction 

and other possible sources of reinforcement, such as sensory effect, control 

what animals do but not what they learn. They may affect complex learning 

in one way or another, but reinforcing mechanisms are not essential to the 
learning process. 

An association theory of learning. We must then ask, Why does learn- 
ing take place? The answer is very old; in its simplest form it extends back 
to Aristotle. We can say that the only condition that seems basic to learning 
is that two things become associated. To make this statement intelligible, 
however, we have to state what kind of things become associated and under 
what conditions. On this point, there are several different views. 

In contemporary psychological theory one of the most common ideas 
IS that stimuli and responses are the associated factors. Stimuli come to elicit 
responses that they previously had not elicited because, perhaps quite by ac- 
cident, these stimuli occur simultaneously with certain responses. The model 
for this process is the classical conditioned response. Here we elicit an un- 
conditioned response by using an appropriate unconditioned stimulus at 
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the same time that we present a conditioned stimulus. Because the con- 
ditioned stimulus is simultaneous with the response, it comes, through 
association, to elicit that response. 

This association theory can be applied directly to classical conditioned 
responses. It also applies to instrumental conditioned responses, even 
though we cannot always produce experimentally stimuli that are simulta- 
neous with the response to be conditioned. Suppose, lor example, we are 
trying to teach a dog to retrieve something when we say, ‘‘Go get it, boy.” 
We can get dogs to retrieve easily enough, but it is much more difficult 


to get them to retrieve only on a given signal. Let us see how this kind of 
association theory says the process occurs. 


The best-known advocate of this kind ot association theory has been 

E. R. Guthrie, who has this to say about association: "A combination ol 

stimuli which has accompanied a movement will on its recurrence tend to 

be followed by that movement” (Guthrie, 1935, 1952). For Guthrie the 

classical conditioned response is the model tor all learning processes. In 

most cases, however, we cannot observe conditioned stimuli (for example, 

the rat pressing the bar in the Skinner box); and so in Guthrie’s theory 

they take on the character of hypothetical constructs. That is to say, we 

invent some conditioned stimuli for a particular set of learned responses. 

In the case of lever pressing, the conditioned stimuli are kinesthetic, accord- 
• 

mg to Guthrie ( 1952) . In other words, the rat makes a scries of movements, 
these stimulate the sense organs of the muscles, and this combination of 
muscular sensations serves as a conditioned stimulus. 

Thus, according to this particular version of association theory, if we 
wish to teach someone an instrumental response, we try to arrange it so 
that the stimulus and the response occur together. Obviously with such 
a principle of learning the distinction between classical and instrumental 
conditioned responses becomes purely a matter of convenience. Guthrie 
(1952) believes that this distinction may even be a bit pernicious, since 
it tends to make us accept instrumental behavior as purely “spontaneous” 
and prevents us from looking for stimuli which might cause the behavior. 

Strangely enough, however, Guthrie’s major experimental study is on 
instrumental learning. Guthrie is dissatisfied with the typical experiment 
on instrumental behavior, because, he claims, it examines the results of 
behavior, not the behavior itself. Take the lever-pressing case, for example. 
In the Skinner box we find out how many times per minute the rat presses 
the lever; ordinarily we are not interested in the way in which the lever is 
manipulated by the animal. Guthrie objects to this approach, for he believes 
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that it is the association of particular movements of the animal with stimuli 
that is really important and that the result of the movement is only second- 
ary. Consequently in an experimental situation, Guthrie and Horton (1946) 
studied the way in which cats learned to escape from a puzzle box. Instead 
of merely recording the time it took the cats to get out of the box or the 
number of times they escaped, Guthrie and Horton observed the cats in 
action and took motion pictures. One of the things that impressed them 
was the degree of behavioral stereotypy. In many cases the cats’ move- 
ments were almost identical from trial to trial. This indicated to Guthrie 

that there was a close association between the cues and the movements 
involved. 

One of the implications of Guthrie’s notion of association of stimulus 
and response through contiguous exposure is that it takes only one pairing 
for conditioning to occur. This seems to go against everything we have 
said thus far, since curves show that learning is a gradual process. Guthrie 
has a good answer for this: An ordinary instrumental act (such as lever 
pressing or escaping from a puzzle box) involves many different move- 
ments. Thus it may take many trials for the animal to perfect a pattern of 
movements that is eventually successful in performing the act. Most be- 
havior involves the association of many stimulus-response units. We are 
constantly learning and unlearning these associations, so that there is a 
continuous fluctuation in our behavior. Therefore it takes time to perfect 
a series of movements that accomplish a particular result. 

This last point of Guthrie’s has been elaborated into an important 
mathematical theory of learning by Estes and Burke (Estes, 1950; Estes 
and Burke, 1953). In this theory the behavioral variability from trial to 
trial is attributed to the effective stimulus. On each trial the animal is 
stimulated with a sample of the available stimuli. The samples are partially 
overlapping on each successive trial, but the response that occurs on any 
one trial is that which is conditioned to 100 per cent of the sample stimuli 
on the particular trial. The fact that there will be sampling variability from 
trial to trial in the effective stimuli means that new unconditioned elements 
as well as old conditioned ones will be present each time. 

For example, all the stimuli in the Skinner box constitute a population 
of stimuli; at any one moment, however, the rat is stimulated only by a 
hample of these. Sometimes these samples will overlap a great deal, and 
occasionally not at all. From the laws of probability one can generate a 
learning curve for lever pressing based on the frequency with which suc- 
cessive stimulus samples will contain common elements. 
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We shall have occasion to examine some of the mathematical properties 
of this theory of Estes and Burke later. At present we should note that it 
suggests, as does its parent theory by Guthrie, that all conditioning is essen- 
tially classical and that the distinction between classical and operant con- 
ditioning is only an experimental convenience, not a basic difference. Both 
of these theories assign all conditioning to the classical variety because they 
suggest that the basis of learning is the establishment of a correlation be- 
tween some novel stimulus and a response it has not hitherto evoked. By 
confirming this new stimulus-response connection enough times we can 
make sure that the elements of the stimulus population and the components 
of the response are sufficiently associated to produce efficient performance. 
We must refer to this association as a correlation of stimulus and response, 
because in instrumental behavior we cannot always cause the association 
to be present; we are only capable of devising a situation in which there 
will be a correlation between a stimulus situation and a “spontaneous” 
response. Thus, this particular variety of association theory says that all 
learning is exemplified by classical conditioning, and only our lack of con- 
trol over unconditioned stimuli for instrumental or operant behavior pre- 
vents us from doing experiments on instrumental behavior that are exactly 
parallel to those on respondent behavior. 

In this kind of association theory, therefore, the function of reinforce- 
ment is simply the mechanical one of providing an end to the stimulus- 
response sequence. Reward serves to prevent the animal from unlearning 
what it has already learned by keeping it from reacting in other ways to 
the stimuli that lead to the response. Food at the end of a maze keeps the 
animal there instead of allowing it to wander through the maze unlearning 
proper turns and learning new turns to some blind alley. Thus, the rat eventu- 
ally does learn the maze. Reward only serves to keep the rat from unlearning 
what it has learned. Incidentally, this suggests that the rat would learn the 
maze if we simply removed it from the goal box at the end of the trial 
instead of giving it food. Seward (1942) tested this notion with the Skinner 
box — that is, he simply removed the animals from the box after they pressed 
the lever — and they did learn, though not so well as food-rewarded animals. 

It is difficult to see how the stimulus-response association theory could 
predict performance in latent-learning cases without some additional prin- 
ciples, for latent-learning experiments show that animals learn more than 
they perform during the latent phase. This particular association theory, 
as we have seen, puts its greatest emphasis upon performance— the move- 
ments made by the animal. 
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Another association theory. The topic of latent learning brings us to 
another association theory, one that is radically different. As a matter of 
fact, it is probably unfair to call it an association theory at all. It has one 
thing in common with the association theory that we have been talking 
about, though. The idea is that need-reducing reinforcement has little to do 
with learning, but that learning is a matter of contiguity. In this case, how- 
ever, the contiguity is not necessarily between stimulus and response. 

The foremost proponent of this view, that is, of an associative theory 
that does not use the classical conditioned response as its model, is E. C. 
Tolman. In a series of publications,^ he has developed and expanded what 
he calls a “purposive” theory of learning. This theory emphasizes “sign 
learning” as opposed to response learning. This means that animals usually 
learn (when running a maze, for example) the suc(^ssion of stimuli that 
lead to the goal. In one of Tolman’s characteristically epigrammatical 
phrases, the rats make “cognitive maps” of the maze (1948). Because 
animals learn relationships between stimuli rather than responses, he was 
able to predict the sudden shifts in performance obtained in complicated 
latent-learning experiments. 

Although there are many difficulties in Tolman’s theory, it has led to 
some interesting experiments. For the present we shall be content to men- 
tion it merely as a theory of learning that does not emphasize the need 
for reinforcement to “stamp in” learned responses. Reinforcement may help 
learned responses but learning itself can take place without it. 

Perhaps the most important contribution of Tolman has been his distinc- 
tion between learning and performance. He made this distinction early, and 
since then, perhaps because of the latent-learning experiments, most theo- 
rists have adopted it. Hull, for example, accepted it in his book written in 
1943, though he still defended the position that reinforcement was neces- 
sary for learning. In his last publications (1951, 1952), however, Hull 
accepted Tolman’s distinction in principle by admitting that learning was 
a function of neither reinforcement nor drive. 

The current status of theories of learning and reinforcement. There is 
no doubt that we can characterize most instrumental behavior as reinforce- 
ment dependent. If we look at most examples of laboratory behavior, we 
see that there is usually some special event necessary to produce and main- 
tain the behavior in strength. This can be giving a pellet of food to a hungry 
rat or allowing a monkey to glimpse a toy train in operation (Butler, 1954). 

2 Many of Tolman’s papers have been brought together in a collection (see Tolman, 
1951). 
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One of the arts in the control of behavior is knowing the range of reinforce- 
ments possible for a given animal and how to administer them for optimal 
results. Classical conditioning, of course, depends upon the reinforcing 
property of the unconditioned stimulus. As does instrumental behavior, a 
classical conditioned response tends to disappear if it is not occasionally 
reinforced with an unconditioned stimulus (see Chapter 3). There is no 


doubt about the importance of the reinforcement operation in the produc- 
tion and control of behavior. 

There is now strong evidence, however, that reinforcement is not neces- 
sary for learning, although it does probably modify learning. Thus rein- 
forced learning in a complicated problem, such as learning a long maze, is 
likely to be different from unreinforced learning. This is not because rein- 
forcement is necessary to learning, but because reinforcement indirectly 
modifies what animals learn by determining what they do. 


Thus, although we described earlier how' learning increases as a function 
of number of reinforcements, we should now modify this to see how learn- 


ing increases as a function of the number of exposures or experiences in 
the learning situation. Usually we reinforce successive tries, so that an in- 
crease in the number of reinforcements is correlated with the change in 
behavior. The essential requirement for a series of changes in behavior is 
simply that the animal behave in a certain way on successive exposures to 
the situation. All that is necessary to maze learning is that the animal run 
to the goal box a certain number of times. We can usually accomplish this 
most efficiently by reinforcing the animal. Thus the important variable in 

learning is the number of trials which are given an animal on a particular 
problem. 

The distinction between classical and instrumental conditioning. We 
have seen in Guthrie’s stimulus-response association theory that the distinc- 
tion between classical and instrumental conditioning is a matter of con- 
venience. Other theorists, however, maintain that there is a fundamental 
difference between them. Mowrer (1947), for example, points to the fact 
that the reinforcement in classical conditioning is not need-reducing, while 
such reinforcement is usually necessary to produce instrumental or operant 
behavior. Other investigators have pointed out that classical conditioning 
IS characteristic of simple reflexes, and instrumental conditioning of volun- 
tary behavior. Certainly, at the present time, we cannot come to a definite 
conclusion as to whether the distinction between classical and instrumental 
conditioning is real or merely convenient. This problem will come up in 
later chapters, and we shall see that it is of great current interest. 
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TEMPORAL ASSOCIATIVE FACTORS IN CONDITIONING 

Since reinforcement is not essential to learning, the most fundamental 
conditions we can describe are associations — associations between stimulus 
and stimulus, stimulus and response, and response and response. Associa- 
tion means that two events, stimuli or responses, must occur close together 
in time. We have already examined the data on response-reinforcement 
associations, but not the stimulus-stimulus relationships in classical condi- 
tioning. In this section we shall look at temporal factors in classical con- 
ditioning. These are of basic importance because classical conditioning is 
the simplest example of associative learning, 

Pavlov's studies on time relations in conditioning. Pavlov (1927) used 
two different techniques in the study of time relations in classical condition- 
ing. In one, the conditioned and unconditioned stimuli overlapped each 
other in time. The conditioned stimulus came either before the uncondi- 
tioned stimulus or the two were simultaneous, but the essential condition 
was that they overlapped. In the other technique, the two stimuli did not 
overlap. Usually the conditioned stimulus was presented first, and then, 
after an interval, the unconditioned stimulus. 

Pavlov thought that simultaneous presentation of the conditioned and 
unconditioned stimuli produced the most rapid conditioning, and in all the 
experiments in his laboratory, they were presented in this way. Following 
this, he gradually lengthened the interval between the conditioned and un- 
conditioned stimuli, or allowed longer exposures of the conditioned stimulus 
before he presented the unconditioned stimulus. 

When the unconditioned stimulus was delayed, Pavlov found that animals 
could learn to delay the onset of the conditioned response for long periods 
of time. They would delay giving a conditioned response until just before 
the unconditioned stimulus. Pavlov took this to mean that the conditioned 
response was actively inhibited. He used the delayed conditioned response 
as an example of what he caUed “internal inhibition.” He also found that 
animals could delay the onset of the conditioned response when the condi- 
tioned stimulus was presented and then withdrawn before the unconditioned 
stimulus came on. This Pavlov called the “trace” conditioned response, and 
he considered it also to be an example of internal inhibition. He founds 
moreover, that it was more difficult to establish the trace conditioned re- 
sponse than the delayed conditioned response. 
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These observations are important in the present context because they 
indicate associations between conditioned stimuli and conditioned responses 
to be extraordinarily flexible. Such associations do not have to be based on 
simultaneity or, indeed, upon close temporal association. Pavlov could 
introduce delays of many minutes in the trace conditioning procedure and 
still produce a conditioned response; the conditioned stimulus would come 
on briefly and then, after a w'ait, the conditioned response would occur. For 
the production of such delays it is apparently essential that nothing else 
occur between the conditioned stimulus and its response. Pavlov's experi- 
ments w'ere performed in sound-deadened rooms in order to control the 
stimuli impinging on the animal, so that the animal would not be stimulated 
in any way between the occurrence of the conditioned stimulus and the 
time for the conditioned response. Such an absolutely controlled environ- 
ment seems to be necessary for very long delays. The animals themselves 
came to inhibit activity in the time interval, and in some instances appeared 
to go to sleep. Thus, as appears to be the case with delay between responses 
and reinforcement in instrumental conditioning, if nothing interferes with 
the association the delay can be bridged. 

Rate of learning and delay of the unconditioned stimulus, A number of 
investigators have studied the rate of conditioning as a function of the time 
interval between the conditioned and unconditioned stimuli. In these studies 
the highest rate of conditioning occurs when the onset of the conditioned 
stimulus just precedes that of the unconditioned stimulus (Spooner and Kel- 
logg, 1947; Kimble, 1947; Moeller, 1954).^ These studies were all per- 
formed with human subjects; the conditioned responses were variously a 
hand movement, the blinking of the eyelid, and a galvanic skin response. 
Despite these differences, the studies all agree that the highest rate of con- 
ditioning occurs when the conditioned stimulus precedes the unconditioned 
stimulus by approximately half a second. Figure 7 shows the data on the 
conditioned hand movement, together with some data from earlier studies 
of the same response (Wolfle, 1930, 1932). 

One curious case of the time relation between the conditioned and un- 
conditioned stimuli is shown in Figure 7. This is backward conditioning. In 
backward conditioning the unconditioned stimulus precedes the conditioned 
stimulus. Pavlov thought that conditioning could not take place in this case; 
however, the data presented in Figure 7 show some evidence for backward 

3 The experiment by Kimble (1947) is not strictly an example of classical condition- 
ing, though it is sufficiently close to be considered so for present purposes. 
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conditioning. It is probably not true associative conditioning, however, but 
more likely a sensitization of the whole response system. The unconditioned 
stimulus (an electric shock in the cases in which backward conditioning 
is found) sensitizes the subject so that he will give a response to almost 
any stimulus. This interpretation agrees with the data of Spooner and Kel- 
logg (1947), They find that the number of backward conditioned responses 
is high early in training but decreases during it. This is because the subjects 
readily adapt to stimuli other than the shock. 



Figure 7. Results of the Spooner and Kellogg (1947) ond the Wolfle (1930, 1931) studies on the 
time interval between conditioned and unconditioned stimuli. The curve labeled ^'present study" is 
om the data of Spooner and Kellogg. Notice that the maximum frequency of conditioned responses 
occurs when the wnditioned stimulus precedes the unconditioned stimulus by a fraction of o second. 
Backward conditioned responses rarely occur. 

Rate of conditioning and association. All studies show that conditioning 
is less efficient when the stimuli are presented together than when the con- 
ditioned stimulus precedes the unconditioned stimulus by a brief interval. 
This fact is fundamental to the interpretation of the associative processes. 
The association is not always between the stimuli, but may be between one 
stimulus and the trace, or aftereffects, of another. Just why asynchrony is 
best is not clearly understood at present. The brevity of the optimal time 
interval between the onset of the conditioned stimulus and unconditioned 
stimulus suggests that this effect has to do with the time of events in the 
central nervous system. The important fact, however, is that the rate of 
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conditioning drops rapidly as the time interval between the stimuli is in- 
creased to even a few seconds (Moeller, 1954), In trace conditioning this 
interval may be extended, but only through tedious training under condi- 
tions in which there is a minimum of external stimulation. Thus while the 
duration of the stimulus trace is probably long, it is easily obscured by other 
stimuli, and it is therefore difficult to obtain the kind of associations char- 
acteristic of classical conditioning except with very short time intervals. 



CHAPTER 3 


EXTINCTION AND 
PARTIAL REINFORCEMENT 


When learned acts are no longer reinforced, they usually diminish in 
strength. Frequently, for example, colicky babies learn to cry in order to 
obtain extra attention. If this happens, parents are sometimes advised to 
avoid picking up the baby when it cries so that it will stop crying for atten- 
tion. It is doubtful whether many parents last out the process, but the advice 

Itself is testimony to the common knowledge of the effects of withdrawal of 
reinforcement. 

The effects of removing reinforcement from a particular bit of behavior 
are quite complicated. The general effect — diminished strength of response 
caused by a failure of reinforcement — is called extinction, and in the first 
part of this chapter we shall look at the way in which this is modified by 
different variables. In later sections we shall examine the more general prob- 
lem of the effects of intermittently given reinforcement — we shall see what 
happens when a given response is reinforced some, but not all, of the time. 

extinction 

In the simplest examples of learning, a response is reinforced every time 
it occurs. Thus, each time the rat in the Skinner box presses the lever, it 
receives a pellet of food. After the rat has learned about the correlation be- 
tween food and pressing the lever, it wUl continue to press the lever at a 
steady rate as long as it remains hungry. If the reinforcement is discon- 
tinued, the rat will continue to respond for a while, but at a gradually 
dimmishing rate and with increasing irregularity. Figure 8 shows how with- 
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drawal of reinforcement affects rate of responding in the Skinner box. From 
this figure it is clear that the rate of responding is high at the beginning 
(right after reinforcement is withdrawn) and that it gradually diminishes, 
so that by the end of the first hour without reinforcement, the rat is going 
long periods of time without giving a response. It is also clear that the rate 
of responding is quite irregular during extinction (compared with the highly 
regular rate under reinforcement shown in Figure 2). The rat may press 
the lever rapidly for a few minutes and then cease to press it altogether for 
a period of time. 



Figure 8. An extinction curve for the lever response in the Skinner box. The cumulative number of 
responses is plotted on the ordinate. The rate of responding is high at the beginning^ but then it 
begins to decline and become more Irregular. Long periods of time pass without responding, until, 
finally, the animal olmost completely stops responding. (Skinner, 7938, dofo from F. S. Keller and 
A. Kerr,) 

In the same way, the classical conditioned response gradually grows 
weaker if the unconditioned stimulus is removed. Thus, in Pavlov’s experi- 
ment presented in the first chapter, if the tuning fork is sounded a number 
of times without being paired with the meat powder, the salivary response 
to the tuning fork gets smaller and finally disappears. 

Resistance to extinction. It is apparent from the two examples we have 
just looked at that reinforcement creates a reserve of responses that can be 
emitted by the animal after the reinforcement is withdrawn. This reserve, 
or potential for responding in the absence of reinforcement, is usually called 
resistance to extinction or, sometimes, the reflex reserve (Skinner, 1938). 

Resistance to extinction depends, to some extent, upon the number of 
reinforcements given an animal before extinction is begun. Figure 9 shows 
that if a small number of reinforcements is given, few bar-pressing responses 
are emitted in extinction before animals cease responding altogether. As 
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the number of reinforced responses is increased, however, the number of 

responses in extinction also increases. But the number of responses increases 

only up to a limit. Thus, for low numbers of reinforcements, resistance to 

extinction depends upon the number of reinforcements that have been given 

before extinction is begun. Maximal resistance to extinction, however, is 

obtained with relatively few reinforcements, and after a certain point one 

can go on giving reinforcements without appreciably increasing resistance 
to extinction (Skinner, 1938). 



Time and Effort in Extinction 

Two variables of considerable theoretical importance in extinction are 
(1) the distribution of responses in time and (2) the effort required to 
make responses during extinction. These are linked here because together 
they provide information important to the theory of extinction. Let us first 
look at the problem of the distribution of responses. 

Distribution of responses in time. One of the main advantages of giving 
discrete trials during an experiment on learning is that the experimenter 
can control the rate at which animals respond. In the Skinner box, as it 
is usually used, the rate of performance depends upon the animal and such 
indirect means of control as hunger. Many experimental examples of instru- 
mental learning, including bar pressing, can be so arranged that discrete 
trials are given after the fashion of the classical conditioned response. For ex- 
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ample, it is a simple matter to withdraw the bar from the Skinner box and 
to present it again only when we wish to begin the next trial. 

Control over the rate at which responses occur is important to students 
of learning because Pavlov (1927) reported that a classical conditioned 
response extinguishes much more rapidly when trials occur close together 
than when they are spread apart. More recently experiments have been 
performed to see if this is also true of instrumental conditioning. 

First of all, it seems clear that performance of instrumental behavior 

during reinforcement is impaired when trials are massed together (Hovland, 

1936; Spence and Norris, 1950). Secondly, most studies show that massing 

trials during extinction has the effect of decreasing resistance to extinction 

or of decreasing the tendency to respond on any given trial during extinction 

(Gagne, 1941; Reynolds, 1945a; Rohrer, 1947; Teichner, 1952). Other 

studies, however, show that under some conditions resistance to extinction 

can be greater with massed trials (Sheffield, 1950) and that under still other 

conditions the spacing of trials probably makes no difference (Porter, 
1939). 

Teichner (1952) points out the probable reason for these differences. 
He suggests that, in general, acquisition is slower when trials are massed 
closely together and that extinction is faster under these conditions, but that 
these effects are relatively small and easily obscured by other conditions. 
For example, Teichner’s experimental results on a running response in rats 
show that the effect of spacing trials during extinction depends upon their 
spacing during acquisition. The greatest resistance to extinction is found 
when the same spacing of trials is used for both; when different spacing is 
used, resistance to extinction decreases. Since Stanley ( 1952) shows experi- 
mental evidence for this same point, we can conclude that maximal resist- 
ance to extinction is produced by using the same distribution of trials in 
both extinction and conditioning and that slightly greater reduction in resist- 
ance to extinction occurs if the distribution of trials in extinction is faster, 
rather than slower, than that in conditioning. 

Effort and extinction. The role that effort plays in extinction is impor- 
tant for theoretical reasons. One of the major theories of extinction, in fact, 

has assigned the decrement in response to a fatiguelike inhibition created by 
continued responding. 

A well-known experiment by Mowrer and Jones (1943) provides evi- 
dence suggesting that effort does influence resistance to extinction. These 
investigators trained rats to press a lever for food in the Skinner box. During 
acquisition the lever was counterweighted in different amounts, so that the 
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rats learned to press the bar when it required 5 grams to activate the feeder, 
when it required 42.5 grams, and when it required 80 grams. At the end 
of acquisition the rats were divided into three groups, and each group was 
extinguished with a different counterweight attached to the lever. There 
was a simple negative relationship between resistance to extinction and the 
amount of effort required to push the bar. This general effect is confirmed 
in a study of Solomon’s (1948) in which the distance rats had to jump to 
reach a goal box was the variable producing effort in the response. 

These two experiments leave something to be desired, however. It is 
possible that the differences in resistance to extinction were due to varia- 
tions in the amount of training given the different responses. When careful 
controls are introduced into an experiment on effort, resistance to extinction 
appears to vary only a small amount for responses requiring different 
amounts of work (Maatsch, Adelman, and Denny, 1954). Furthermore, 
the differences that do occur may be the result of differences in operant 
level of responding for various conditions; that is, rats may “spontaneously” 
engage in easy responses more frequently than in hard ones. 

Thus, while it seems likely that work and time are variables in determin- 
ing the course of extinction, they are not the most important ones. The 
results achieved in well-controlled experiments suggest that other factors 
are much more significant in determining the behavior of individual animals. 

As we shall see presently, this conclusion has important implications for 
theories of extinction. 

Spontaneous recovery and disinhibition. Pavlov, in his studies of con- 
ditioned salivation, discovered two phenomena that he thought to be of 
considerable theoretical importance. They are called spontaneous recovery 
and disinhibition. Again, we link them together because they are important 
in the theory of extinction. In both spontaneous recovery and disinhibition 
the major effect is a recovery of the conditioned response after a certain 
amount of extinction. Thus, a strong argument can be made that extinction 
generates an active inhibition of the conditioned response and that this in- 
hibition can be removed by particular conditions. Let us examine sponta- 
neous recovery rather closely, since it is the more important and pervasive 
of the two phenomena. 

Spontaneous recovery. An example from Pavlov’s laboratory will illus- 
trate the basic principle of spontaneous recovery. Pavlov (1927) produced 
a conditioned salivary response by allowing a dog first to see some meat 
powder and then to eat it. After the dog had learned to salivate at the sight 
of the meat powder, Pavlov extinguished the response by no longer putting 
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the meat powder into the animal’s mouth. It took only a few trials of this 
sort to extinguish the conditioned response. T he animal was then removed 
from the experimental room for approximately two hours. When the dog 
was brought back and allowed to look at the meat powder again, the con- 
ditioned response returned. True, the amount of saliva flow was not so great 
this time as it had been before the extinction process was begun, but there 
was no doubt that the conditioned response had recovered to a considerable 
extent — actually about one-sixth of the original amount. 

Another experiment (Ellson, 1938) illustrates the occurrence of sponta- 
neous recovery in an instrumental conditioned response. Four groups of rats 
were trained to press a lever in the Skinner box. Following the conditioning 
procedure, all animals were permitted to operate the lever without rein- 
forcement until they had failed to respond for a period of live minutes. One 
group of rats was then removed from the box and allowed to rest for five 
minutes. These rats were replaced in the box and additional unreinforced 
responses were counted. The second group of rats resumed extinction after 
a pause of 25 minutes, the third group after 65 minutes, and the fourth 
after 185 minutes. All animals showed an immediate tendency to press the 
lever when replaced in the box, even though all of them had failed to press 
the lever for a period of five minutes prior to their removal from the box. 
Figure 10 shows that the amount of recovery, or the number of responses 
after rest, is an increasing function of the length of the rest period. Like 
most effects associated with the learning process, however, this function 
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Figure 10. Number of lever responses in a second extinction period following various intervals of 
rest. In this study most spontaneous recovery occurs within an hour after original extinction. (Data 
from Bt/son, 1938.) 
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seems to have its limit, so that an extremely long rest period yields no more 

spontaneous recovery than a moderately long one. In this experiment the 

amount of recovery reached its limit at about one-fifth the original strength 
of the response. 

Spontaneous recovery is a genuine and stable part of the extinction process. 
We may, for example, try to stop our dog from begging at the dining-room 
table by ignoring it when it does so, and we may find by the end of the meal 
that we have been successful— the dog has retired to the living room. Never- 
theless, the next evening it will probably beg as before. Since spontaneous 

, * ) ) SO persistent this time 

and will stop sooner. Repeated extinction periods will finally eliminate the 
response. 

Disinhibition. The term disinhibition stems from Pavlov’s theory of 

extinction, and he placed great stress on the importance of this effect. In his 

original demonstration of disinhibition, Pavlov established a conditioned 

response to the sight of meat powder. Once it was well established, he 

extinguished it by showing the meat to the dog without letting the animal 

eat It. When the conditioned response was completely extinguished, Pavlov 

paired a new stimulus with the conditioned stimulus. This stimulus was not 

one that could elicit the conditioned response in itself, but when it was 

paired with the extinguished conditioned stimulus it did. Thus, the salivary 

response to the sight of food, which had been completely extinguished, came 

back again when the new stimulus, vibration, was presented with the con- 
ditioned stimulus. 


THEORIES OF EXTINCTION 

Animals cease responding when they are no longer rewarded. So simple 
a fact wouW hardly seem to demand very subtle theoretical interpretation; 
yet extinction has been one of the aspects of behavior involving the most 
complicated theorizing. Actually, of course, a theory of extinction is fun- 
damental to any theory of behavior, for the fact that animals cease to per- 
form unreinforced acts is just as important as the fact that they learn rein- 
forced acts in the first place. 

In the following section we shall study a few of the more important 
theories of extinction. Then we shall try to evaluate the current status of 
the theoretical questions and examine some of their weaknesses. 

Response-produced inhibition theory. The response-produced inhibi- 
tion theory is now the classical theory of extinction. Like “classical?’ theories 
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in many other fields, however, it has been thoroughly demolished by critics 
and the rubble used to build new theories. Although the response-produced 
inhibition theory is no longer acceptable in its entirety, two of its ideas are 
still of considerable importance. For this reason, and also because it pro- 
vides a good case history of the fate of psychological theories, we shall 
look at it in some detail. 

The most complete statement of the response-produced inhibition theory 
comes from Hull (1943), though he took the basic concept of inhibition 
from Pavlov and borrowed other ideas from a variety of sources. 

Reactive inhibition. Hull started with the assumption that whenever an 
organism makes a response it also generates some inhibition to that re- 
sponse. This means that, all other things being equal, once a response takes 
place, it is somewhat less likely to occur in the immediate future. Such in- 
hibition may be thought of as analogous to fatigue in that it makes the next 
response more difficult. Like fatigue, this reactive inhibition disappears 
after a period of rest; if there is not enough rest between responses, how- 
ever, the inhibition accumulates from response to response. 

Hull (1943) advanced the hypothesis that the amount of inhibition that 
results from a series of responses is a positively accelerated function of the 
amount of work the response involves.^ In other words, as the response 
requires more effort, the inhibition accumulates at an increasing rate. 
Further, he specified, rest reduces the inhibition generated by each response 
m a way that is best described by a simple negative exponential function. 
Eventually, with enough rest, the response wiU completely recover its 

strength. 

In this theory of Hull’s, reactive inhibition will accumulate whether the 
response is reinforced or not. If the response is reinforced, the positive 
effects of reinforcement must overcome the negative effects of reactive in- 
hibition. Most of the time the positive effects wUl keep the animal respond- 
ing, but there is an interesting exception— a matter of experimental fact— 
which lends plausibility to Hull’s notion. If, during conditioning, trials are 
massed very close together, the animals may slow their rates of responding 
and even stop altogether, even though the reinforcement is stiff present 

(Hovland, 1936; Calvin, et al., 1956). This effect has been called “inhibi- 
^ tion of reinforcement.” 


geste” ^ considerably revised the substance of this assumption; he sug- 

gested that for responses involving large amounts of work (relative to the ability of 

mall Tm ‘I ‘han that fl 
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The most important aspect of the response-induced inhibition theory, 
however, concerns the results of extinction. When responses are no longer 
reinforced, reactive inhibition is thought to accumulate without being coun- 
teracted by any positive effects of reinforcement. The result is that the 
animal eventually ceases to respond. Since reactive inhibition disappears 
with time, we should predict spontaneous recovery of the response after a 
rest. Remember, however, that in the example from Pavlov’s laboratory, 
spontaneous recovery was only about one-sixth the original response strength. 
At best, spontaneous recovery regains approximately 50 per cent of the 
response strength at the end of conditioning. Since spontaneous recovery is 
incomplete, Hull postulated a second factor, conditioned inhibition, to ac- 
count for the permanent effects of extinction. 

Conditioned inhibition. Hull regarded reactive inhibition as a negative 
motivational state. Presumably, animals are motivated to avoid reactive in- 
hibition as much as they are motivated to avoid an electric shock. If we 
accept the need-reduction theory of reinforcement, it is easy to see how 
escape from, or avoidance of, reactive inhibition would constitute a rein- 
forcement. Thus, when an animal ceases to respond, its fatigue state, or 
reactive inhibition, is reduced. This, in keeping with the need-reduction 
notion, is reinforcing. Reduction of reactive inhibition supposedly reinforces 
the animal for doing nothing, and the resulting failure to respond is called 
conditioned inhibition. 

The notion of conditioned inhibition does not entirely depend upon the 
need-reduction theory of reinforcement, even though Hull presents the idea 
in this context. Reactive inhibition could bring about failure of response. 
Simply by contiguity, failing to respond could become conditioned to the 
cues that previously brought about the response. Thus, learning not to re- 
spond could as easily be the result of simple association as of reinforcement 
by need reduction. 

How reactive and conditioned inhibition work in extinction. The theory 
of extinction based on reactive and conditioned inhibition seems compli- 
cated, but its essentials are simple. Extinction is the result of an active inhi- 
bition of a learned response. This inhibition is composed of two parts — 
reactive inhibition, which recovers with rest, and conditioned inhibition, 
which does not. Spontaneous recovery of an extinguished conditioned re- 
sponse is predicted from the assumption that reactive inhibition disappears 
with rest, while the permanent part of extinction is assumed to be the result 
of conditioned inhibition. The theory predicts more rapid extinction when 
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trials are massed during extinction and when a greater amount of effort is 
required in performing the conditioned response. 

As we shall see, this two-factor theory of extinction (as it has been 
called) has been applied to a variety of problems in behavior— for example, 
to the fact that rats generally prefer not to repeat the same response on 
successive trials in a maze. To take another important instance, it has been 
applied to the pronounced effects of massed practice and recovery with rest 
on the performance of complicated human skills, such as learning to type. 
Despite the widespread application of the theory, however, we now suspect 
that it contains some serious defects, and it is therefore important that we 
look at some of the criticisms of the theory. Since attempts to examine this 
theory experimentally have led to some valuable new information about 

behavior, the next section will serve as more than just a critical analysis of 
the response-produced inhibition theory. 

A critical analysis of the response-produced inhibition theory. The re- 
sponse-produced inhibition theory, as stated by Hull, has been criticized on 
three grounds: (1) the theory is a good deal more ambiguous than it first 
appears (Koch, 1954); (2) it leads to contradictory deductions (Gleitman, 

Nachmias, and Neisser, 1954); and (3) the empirical evidence for it is 
weak, and that against it, strong. 

Let us first look at the empirical evidence against the theory. This is per- 
haps the least critical objection to the theory, since almost any theory can 
be modified to fit a new fact. The facts stated below, however, are particu- 
larly awkward for the response-induced inhibition theory. 

Extinction without responding. The most important single idea in the 
response-produced inhibition theory is that decrement in responding is the 
result of a negative feedback from responding itself. Thus, the more effort 
a given response requires, the greater the difficulty in producing an addi- 
tional response. Yet there is evidence that a decrement in responding can 
occur even when no overt response causes it. There is the phenomenon of 
‘subzero” extinction, reported by Pavlov (1927). Pavlov showed that if 
we extinguish a classical conditioned response to the point at which it com- 
pletely disappears, and then continue to present the conditioned stimulus, 
these additional trials weaken the conditioned response even further (shown 
by spontaneous-recovery tests). This means that presenting the conditioned 
stimulus alone was effective in producing further reduction of response 
strength— it was not necessary for the animal to give the conditioned re- 
sponse. Thus, response-produced inhibition could not have occurred. 
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It is also difficult for the response-induced inhibition theory to explain 
data derived from experiments on nonresponse extinction. The first example 
of such extinction comes from Seward and Levy (1949). They trained 
rats to run down a runway to secure food placed in a goal box at the other 
end. After conditioning, one group of rats was placed directly in the goal 
box when it was empty of food and permitted to see that the reinforcement 
was no longer there. A second group of rats did not have the benefit of this 
preview. The next day, all the rats were given a series of extinction trials. 
The group that had been able to see beforehand that the food was no longer 
there extinguished approximately twice as quickly as the other rats. It was 
evident that the response had been reduced in strength by the sight of the 
empty box and therefore extinction took place without the running re- 
sponse. 

Several other experimenters have been able to find “extinction without 
resp)onding” (Deese, 1951; Hurwitz, 1955; Moltz, 1955), and others have 
not (Bugelski, Coyer, and Rogers, 1952; Sharlock, 1954); the effect may 
depend on factors that are not yet well understood. Those experiments that 
show nonresponse extinction add some interesting information to our knowl- 
edge of the extinction process. 

In one experiment (Deese, 1951 ) rats were trained to go to one side of a 
single-choice T maze. After the animals had been trained, half of them were 
placed directly in the empty goal box to let them see that there was no food. 
All the animals were then given a series of extinction trials. As in the previ- 
ous experiment, the animals that had been allowed to see that food was 
missing extinguished more rapidly. 

In addition, a test was made for spontaneous recovery from nonresponse 
extinction. One-half of the animals permitted to see the empty goal box 
were given their extinction trials immediately after the look into the goal 
box; the other half was tested 24 hours later. If there were spontaneous 
recovery from the inspection of the empty goal box, the group tested 24 
hours later should have been more inclined to choose the goal-box side of the 
T. However, there were no differences between these groups. A test of spon- 
taneous recovery from ordinary extinction in the same maze showed that 
there was indeed spontaneous recovery when the rats had to run through 
the maze to get to the empty goal box (Figure 11). These results suggest 
that there was little or no response-induced inhibition developed by the 
extinction without responding, while the spontaneous recovery from extinc- 
tion by responding might reflect the presence of inhibition induced by the 
running response. 
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The experiment by Moltz (1955) suggests that extinction without re- 
sponding is the result of the reduction of the secondary reinforcement value 
of the cues from the reinforced side of the maze. This leads to the proposi- 
tion that extinction involves the reduction in the strength of secondary cues 
to command ’ behavior when they are no longer associated with primary 
reinforcements. Hurwitz (1955) points out that, in the Skinner box, if one 
extinguishes early members of the response chain (approaching the lever, 
for example), one also extinguishes the later members (pressing the lever)] 



and he suggests that this may be due to the weakening of the goal gradient 
or gradient of reinforcement. 

Response inhibition and stimulus satiation. One of the facts that have 
tended to support the theory of response-produced inhibition is provided 
by studies which show that, given a free choice, animals prefer not to repeat 
responses they have just performed. For example, if rats are not reinforced 
tor gomg to either side of a single-choice T maze, they wUl tend to alternate 
choices and not repeat their last choice. If response-produced inhibition 
provides an explanation of alternation behavior, there are a number of de- 
ductions that can be made from the properties of response-produced inhibi- 
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tion. For example, decreasing the time interval between trials ought to 
increase the frequency of alternation. Likewise, increasing the effort in- 
volved in making a response should increase alternation. In another instance, 
forcing animals to go consistently to one side of a T maze for a while ought 
to make them tend to choose the opposite side. This tendency to choose the 
opposite should decline if an interval of time is allowed between the forced 
trials and the free-choice trials (reactive inhibition disappears with rest). 
In general, these deductions stand up to experimental test (Heathers, 1940; 
Zeaman and House, 1951; Riley and Shapiro, 1952). 

Despite this rather impressive array of evidence, investigators of animal 
behavior, other than learning theorists, have long held another explanation 
for the failure of responses with repeated elicitation. One investigator 
(Prachtl, 1953) studying the gaping response (the yawnlike opening of the 
mouth observed in young birds when the parent bird returns to the nest 
with food) showed that, while repeated elicitations of the response will 
exhaust it, changing the stimulus will make the response come back again. 
Actually, many experiments have shown the major cause of alternation and 
related phenomena to lie in stimulus conditions rather than in the repetition 
of responses per se. 

One simple experimental result seems strongly against a theory of alterna- 
tion exclusively in terms of response-produced inhibition. If we exchange 
the stimuli associated with each pathway in a two-alternative maze, rats 
will tend to repeat rather than alternate their choice of pathways (Glanzer, 
1953). This suggests that the rats go where there are “new” stimuli. Simi- 
larly, another experiment shows that rats alternate places they go to more 
frequently than specific responses in getting there (Montgomery, 1952). 

A rather exhaustive experiment by Rothkopf and Zeaman (1952) shows 
that the control of alternation in a T maze is not simple; they suggest that 
both inhibition from response and such factors as novelty of external stimu- 
lation control the tendency to alternate. Thus, while it is probable that 
response-induced inhibition plays a part in such phenomena as alternation 
of responses, it is certainly not the only factor, and perhaps not even the 
most important. 

The present status of the response-produced inhibition theory. Putting 
all the evidence together — evidence concerning the distribution of trials, the 
effects of effort on response tendency, alternation behavior, etc. — we must 
conclude that responding has an inhibitory effect upon future tendency to 
respond. Beyond this, however, we cannot go, for it is equally evident that 
this feedback from responding is not necessary to extinction. Furthermore, 
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It appears that there are rather severe logical problems in applying the 
notion of response-induced inhibition to extinction itself. 

The most serious logical problem in the response-induced inhibition 
theory as it is stated by Hull is that it implies that all behavior must eventu- 
ally cease, whether it is reinforced or not (Gleitman, Nachmias, and Neisser, 
1954). For even though habit strength for an> particular response will even- 
tually reach a maximal value, inhibition continues to be added for each 
response. As inhibition is added, so is conditioned inhibition, which thus 
continues to giow while habit strength remains constant. The net effect of 
this will be to make the response weaken and eventually disappear. It is 
true, as we saw above, that responses do sometimes weaken in strength 
even with continued reinforcement and, indeed, they may even eventually 
disappear (Calvin, et al., 1956), but this is the rare exception. 

The root of the matter is that the concept of conditioned inhibition is 
weak. As originally stated in Hull’s theory of learning, it leads to contradic- 
tory properties. We have seen that extinction can occur when conditioned 
inhibition based upon response-produced inhibition should be very weak. 
Therefore, it W'ould seem that the notion of conditioned inhibition is super- 
fluous. This would leave us with the idea that there is a temporary, fatigue- 
hke decrement produced by continued responding. This inhibition may have 
something to do with the organism temporarily ceasing to respond, but it 
cannot have much to do with the permanent effects of extinction. It is clear 
that, even if we do accept the general proposition of response-produced 

inhibition, we will need some concept that will help explain the permanent 
effects of extinction. 


CURRENT ALTERNATIVE THEORIES OF EXTINCTION 

Competition theory. The best-known alternative theory of extinction is 

that put forward by Guthrie (1935, 1952) and, to some extent, by Estes 

(1950). In a word, this theory attributes extinction of a particular response 

to the acquisition of competing response tendencies. Thus, when an animal 

ceases to respond in a particular situation, it is because it has learned not 

to do so by an association between “not doing so” and the stimuli. In some 

cases the animal will learn alternative responses and in others it will learn 
to do nothing. 

This leads Guthrie to say that there are three ways in which responses 
may be eliminated. The first method is to introduce the conditioned stimulus 
subhmmally, so that it will not elicit a response, and then to increase its 
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strength gradually. This is what we do when we train a horse to saddle by 
first putting on only a light blanket and then gradually working up to full 
gear. The blanket is not a sufficient stimulus to set off bucking, and the 
horse is not disturbed by the gradual addition of heavier loads. An experi- 
ment by Kimble and Kendall (1953) shows that this method of “toleration” 
works quite well in a simple avoidance habit in rats. A second method is 
to repeat the conditioned stimulus until the original response is exhausted 
and the organism is too fatigued to respond. This is the bronco-busting tech- 
nique and, incidentally, the special case handled quite well by the condi- 
tioned-inhibition notion. The third method is to present the conditioned 
stimulus when it is mechanically impossible for the organism to respond. 

Aside from providing this common-sense analysis of methods of eliminat- 
ing behavior, the theory of competition advanced by Estes has the merit of 
considerable logical elegance. For one thing, conditioning and extinction be- 
come simply different aspects of the same thing — they are both learning 
by a process of associating elements of particular responses with elements 
of stimulus complexes. Furthermore, as Estes (1955) points out, spon- 
taneous recovery under this theory is not a special phenomenon of inhibi- 
tion, but results from the fact that samples of stimuli conditioned (or ex- 
tinguished) on any one trial may not be present on the next; thus the 
tendency to respond can “spontaneously” change between trials.^ 

There are difficulties, unfortunately, with the competition theory, too. 
For one thing it becomes rather cumbersome when applied to the general 
cases of indirect extinction discussed under Extinction without Respond- 
ing. Furthermore, there are practically no simple relationships between the 
properties of conditioning and extinction, for a maximal rate of responding 
can sometimes be achieved in the Skinner box with only one reinforcement, 
while extinction after one reinforcement may require as many as 150 re- 
sponses before the rate of responding is returned to the unconditioned level 
(Skinner, 1938). In classical conditioning, extinction is usually more rapid 
than the original conditioning. 

These, however, are relatively minor objections, not nearly so serious as 
those to the theory making use of the concept of response-induced inhibi- 
tion. There has been a serious effort, moreover, to examine some of the 
experimental implications of the theory (see, for example, Lauer and 
Estes, 1955). Perhaps the most serious objection to the theory is not a 
logical one. That is, there is some reason for believing that extinction, in 

* For an experimental analysis of the role of spontaneous recovery in a formal com- 
petition theory, see Homme, 1956. 
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part at least, reflects factors other than loss of habit. Let us examine these 
in the context of other theories. 

The frustration theory of extinction. Many experimenters have ob- 
served that when an instrumental habit is extinguished animals tend to be- 
come emotionally e.xcited, as if they were frustrated by the withdrawal of 
reinforcement. Skinner ( 1938), for example, has noted this in rats under- 
going extinction and has attributed to it the comparative irregularity of 
extinction curves. More specilically, some investigators have assumed that 
extinction gives rise to a frustration drive and that this accounts for some 
of the effects one finds in a study of extinction. 

One simple runway experiment ( Adelman and Maatsch, 1955 ) shows that 
if rats are allowed to escape from the empty goal box during extinction, 
they do not show any loss for the originally reinforced habit of running 
down the runway. Of course, if these animals are confined in the goal box 
they do show a loss of response strength. One interpretation of this is that 
escaping from the frustrating conditions of extinction does not produce the 
same emotional aversion to the empty goal box as being confined in it. 

Another series of experiments (Lambert and Solomon, 1952; Lambert 
Lambert, and Watson. 1953) show an effect that is difficult to interpret ex- 
cept by assuming that extinction gives rise to a state of frustration. These 
expenments show that the closer an organism gets to the reinforcement be- 
fore It IS frustrated, the harder it is to extinguish a habit. In one of the ex- 
periments this effect was discovered by blocking rats at various points in 
a simple runway; the farther the block was from the goal box, the more 

sa“me eV t f experiment, essentially the 
saine effect was found with a token-reward habit in young chUdren. 

e interpretation of these experiments is that frustration is greater when 
the organism is blocked just before reaching the goal box than when it is 
blocked farther away. This frustration heightens the animal’s drive an^ 

Jab?’ interferes with either the acquisition of a new 

habit or learning not to respond.” 

It is probably true that the failure of reinforcement for many instru- 
mental acts does produce frustration, and that this has special eLcts on 
behavior as the above experiments show (also, see StaLy and Rowe 

54). Frustration does not provide an explanation of extinction itself’ 

eZrnallctfty""""" W 

Extinction as loss of incentive value. Strangely enough the hvnothi.».c 
that IS probably the simplest has received the least attention.’ This ifth^no- 
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tion that extinction has little or nothing to do with habit or learning, but, 
rather, produces a reduction in response strength by changing the value of 
the incentive (Johdai, 1955). The effects of extinction, according to this 
argument, can be considered simply as the limiting case of the reduction in 
size of reinforcement after learning has taken place (Dufort and Kimble, 
1956). Given the fact that behavior varies according to the size of rein- 
forcement, it seems a theoretical extravagance, as Dufort and Kimble point 
out, to assume that completely different processes take place in extinction — 
reduction of reinforcement to zero after previous reinforcement. 

In the last chapter we discussed another special case of the absence of re- 
inforcement under latent learning. In the experiments on latent learning we 
saw that animals could leam something about a maze even though they 
were not specifically reinforced in the goal box. The fact that the animals 
had learned the maze, however, was not evident until some incentive was 
given them. In the same way, if we extinguish a maze habit by withdrawing 
reinforcement after the maze has been learned, the fact that animals will 
now make more errors (see Tolman and Honzik, 1930) does not mean 
that they have unlearned the correct path to the goal box. Rather, the rats 
appear to have no interest in getting to the goal box. There are, of course, 
many implications in considering extinction to be a special case of zero 
amount of reinforcement. Many of these implications were discussed in the 
previous chapter under amount of reinforcement. 

The present state of theories about extinction. We have seen that there 
is a temporary negative feedback effect on responding resulting from pre- 
vious responses. This effect provides the basis for the concept of response- 
produced, or reactive, inhibition. Reactive inhibition disappears with rest, 
however, and except for special cases, it is doubtful whether it has much to 
do with the more permanent effects of extinction. In addition there is an 
analogous phenomenon in stimulation (Hinde, 1954; Glanzer, 1951); that 
is, all other things being equal, animals tend to avoid stimuli they have just 
been subjected to. It is likely that this “stimulus satiation,” like reactive in- 
hibition, is largely temporary in nature, though this issue has not been 
thoroughly explored. 

Furthermore, responses sometimes disappear because organisms learn 
new responses that displace the old ones. In classical conditioning this may 
mean that the organism acquires a new response to stimuli already condi- 
tioned to another, incompatible response. In instrumental conditioning it 
may mean that the organism learns a new method for receiving the same 
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reinforcement or that it learns that a different set of stimuli is not corre- 
lated with the occurrence of reinforcement. 

There is one important question about this matter of extinguishing an old 

response by learning competing activity. It is not clearly established that 

learning the new response necessarily results in “unlearning'’ the old one. 

The theories advanced by Guthrie and Estes suggest that this is so, but 

there are good grounds for assuming that it is not always true. It is clear, at 

least, that learning a new incompatible response does not always reduce the 

old response to its original, unlearned strength. Those theories that reduce 

all conditioning to the paradigm of classical conditioning would suggest that 

this is so simply because all of the cues that were conditioned to the old 

response are not conditioned to the new one, so that stimuli conditioned to 
the old response are left over. 

At any rate, one of the things that should soon be settled is whether com- 
peting responses always result in the unlearning of the originally established 
response or simply have a temporary effect on performance. If the latter is 
true, when the competing response disappears, the original response should 
spontaneously recover in strength. There is some experimental evidence 

^ situations (Whiting and Mowrer, 

1943). This is a complicated question, and we shall see it again when we 

consider discrimination and conflict in animal behavior, and transfer in hu- 
man verbal behavior. 

In addition to the factor of competition, it is possible that a large number 
of the examples of extinction studied in the laboratory (as well as many 
problems in animal behavior in nature) are the result simply of the change 
in incentive or reinforcement. After our diseussion in the last chapter we 
should suspect that such examples of extinction primarily involve changes 
in what animals are willing to do rather than in what they learn. Thus, many 
reductions in strength of response by nonreinforcement involve reduction in 
the momentary tendency to respond rather than any basic habit patterns 


PARTIAL REINFORCEMENT 

We have seen that learned behavior is maintained in high strength by 
reinforcement, and that if it is not reinforced, it declines in strength This 
effect of reinforcement determines wha, animals learn, since they cannot 
earn m situations in which they do not have some experience; the occur- 
rence of reinforcement determines the kinds of experiences organisms are 
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going to have. Thus far we have considered only two particular cases of re- 
inforcement, one in which occurrence of a response is reinforced, and the 
other in which there is no reinforcement at all. It is obvious, however, that 
there may be intermediate cases. We could, for example, reinforce every 
other occurrence of a response. In general, any case in which we reinforce 
behavior some, but not all, of the time is an example of partial reinforce- 
ment. The schedule of reinforcement is an extremely important variable in 
the control of behavior, and it will be worthwhile to examine this problem 
in some detail. 


SCHEDULES OF REINFORCEMENT 

In general, there are two ways we can control the patterning of reinforce- 
ments and responses. In the first case we can vary the time between rein- 
forcements irrespective of the number of responses between reinforcements. 
Thus, we may arbitrarily decide to reinforce once every five minutes, re- 
gardless of whether the animal responds one hundred times or only once 
during the five minutes. The other way in which we can change the pattern 
of reinforcements is to vary the frequency of responses between reinforce- 
ments. In this case, we can decide to reinforce every tenth response, whether 
10 responses take five minutes or one. In using these two methods, we can 
alter the reinforcement schedule so that it is either haphazard or highly sys- 
tematic. In other words, we can reinforce randomly after one, three, or five 
minutes, or reinforce systematically every minute or every three minutes. 
Since these different ways of scheduling reinforcements have somewhat dif- 
ferent effects, Jet us now look at them briefly. 

Fixed-interval reinforcement. Fixed-interval, or periodic, reinforce- 
ment occurs on a fixed time schedule. This kind of reinforcement has a par- 
ticular and systematic effect on previously learned behavior. When an ani- 
mal becomes accustomed to a fixed-interval reinforcement schedule for, 
say, pressing a lever, its behavior becomes stable. In general, the animal 
will emit a constant number of responses per reinforcement. If, for exam- 
ple, the animal gives an average of 20 responses for each reinforcement and 
it is being reinforced once a minute, its rate of responding will be 20 re- 
sponses per minute. If, however, the animal is reinforced once every two 
minutes, these 20 responses will be spread out over that period, so that its 
rate of responding will be only 10 per minute. The general rule is that the 
rate of responding is inversely proportional to the interval between rein- 
forcements (Skinner, 1938, 1950). 
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Variable-interval reinjorcement. The other extreme of the interval 
schedule is to administer the reinforcement at random times. Thus, after an 
animal has learned a particular response, we might reinforce it the first time 
after one minute, then after thirty seconds, then after three minutes, etc. 
Under these conditions behavior becomes extremely stable, the animal re- 
sponding with almost machinelike precision. 

Fixed-ratio reinforcement. In fixed-ratio reinforcement the reinforce- 
ment IS regularly delivered after a specific number of responses. Here, as we 
might expect, the effects on performance are dramatically different from 
those of fixed-interval reinforcement. First of all. reinforcement on a ratio 
schedule places a premium on responding rapidly, for the higher the rate of 
responding, the higher the rate of reinforcement. If an animal is being rein- 
forced once every 10 responses, it can get a reinforcement every 10 seconds 
by responding once per second. Even more characteristic of fixed-ratio rein- 
forcement IS the fact that the rate of responding is variable. Immediately 
after a reinforcement it is low, and it gradually increases until the next rein- 
forcement occurs. Since this acceleration occurs as the ratio of reinforce- 
ments to responses gets low, unlike interval schedules, fixed-ratio schedules 
produce higher average rates of responding with fewer reinforcements. 

Variable-ratio reinforcement. The variable-ratio reinforcement case is 
perhaps the most interesting, because it is probably more characteristic of 
the conditions in which natural behavior occurs. Variable-ratio reinforce- 
ment is administered after different numbers of responses. If we reinforce a 
well-learned habit on a variable-ratio schedule, we do not get the increase 
m rate of response between reinforcements obtained with fixed-ratio rein- 
forcement since the animal cannot predict the exact number of responses 
necessary for a reinforcement; there is a premium on responding rapidly 
for getting a reinforcement eventually depends upon the number of tries’ 

The effects associated with particular ways of administering reinforce- 
ment are all effects on performance— they are things that will happen to 
a well-learned response when a partial reinforcement schedule is introduced 

Let us now look at the effects of partial reinforcement upon the specific 
conditions of acquisition and extinction. 

EFFECTS OF PARTIAL REINFORCEMENT ON 
LEARNING AND EXTINCTION 

The effects of parttal reinforcement on acquisition. If we tried to train 
a rat to press a lever in the Skinner box by reinforcing it once for every 10 
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responses, we would have a long wait before the animal learned the re- 
sponse. In general this would be true in any situation in which the major 
thing to be learned is the relationship between a response and its conse- 
quence — the reinforcement. Since learning such a relationship is the essence 
of most studies on simple instrumental conditioning, it is not surprising that 
a review of the literature on partial reinforcement (Jenkins and Stanley, 
1950) shows that most studies reveal a tendency for anything less than 100 
per cent reinforcement to slow the rate of acquisition. In other words, rein- 
forcing simple instrumental acts part of the time rather than all of the time 
appears to slow down the rate of learning. 

This retarding effect of partial reinforcement is probably the indirect ef- 
fect of the many conditions that determine an animal’s performance in a 
given situation. We saw in the last chapter that such factors as delay of re- 
inforcement, amount of reinforcement, etc., work primarily on performance 
rather than on rate of learning; any effect upon rate of learning of these 
reinforcement variables is indirect, determined by the fact that performance 
is frequently a major determiner of what animals learn. Much the same 
thing is true of the schedule of reinforcement; any effect it has upon learn- 
ing is probably achieved through an alteration in the nature of the instru- 
mental act or of the stimulus-response sequence that the animal must learn. 

The effect of partial reinforcement on resistance to extinction. The 
most dramatic effect of partial reinforcement is on resistance to extinction. 
All schedules of reinforcement affect resistance to extinction somewhat dif- 
ferently, but it is a general rule that partial reinforcement will nearly always 
produce greater resistance to extinction than 100 per cent reinforcement 
(see Jenkins and Stanley, 1950). 

Earlier in this chapter we saw that the number of times a particular ac- 
tivity had been reinforced determined, up to a limit, how many times the 
activity would be performed in the absence of reinforcement. The maximal 
resistance to extinction after a 100 per cent reinforcement schedule is not 
great. No matter how many times we reinforce lever pressing in the rat, it 
is unlikely that the animal will respond much more than 200 times without 
reinforcement. Thus, habits established and maintained with 100 per cent 
reinforcement are not very resistant to extinction. 

Results obtained by Skinner (1938) on instrumental behavior and by 
Humphreys (1939) on a classical conditioned response showed that a sched- 
ule of partial reinforcement greatly increased resistance to extinction. Ex- 
cept for very special conditions, nearly every experiment since then has 
shown similar results. Not only does partial reinforcement make a habit 
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more resistant to extinction, but one experiment on maze behavior in the 

rat shows that such reinforcement makes it more difficult to learn a new 

conflicting habit (Wike, 1953). The large effects of partial reinforcement 

on extinction may be seen in Figure 12 (Jenkins, McFann, and Clayton 
1950). ^ ’ 

This particular effect of partial reinforcement is of the greatest theoretical 
and practical importance. For one thing, it enables us to understand why 
behavior in natural settings is likely to be so persistent — most behavior is 



Figure 12. Cumulet^e extinction curve, after partial and continuous reinforcentent. For the par- 
ol reinforcement condition, the animals vrere placed on a yarioble-intervol schedule The animals 
m both groups received 200 reinforce before extinction began. (Tenlrin.. McFonn. and C/a, ton! 


probably not reinforced for each try. We also know that the most efficient 

use of reinforcement is obtained by a partial reinforcement schedule (for 

most purposes, a variable-ratio one), probably after an initial group of 100 

per cent reinforced responses; In this way we conserve the strength of rein- 

orcements that are likely to suffer from satiation effects (such as new toys 

do for an infant), and, at the same time, we produce the largest resistance 
to extinction. 

There is apparently no limit to the extent to which a schedule of very in- 
requent reinforcement can increase resistance to extinction. Even in the 
laboratory It is possible to produce habits that will resist the extinguishing 
effects of thousands of unreinforced responses. As a matter of fact Skinner 
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(1950) suggests that if conditions are proper it is possible to produce an 
extinction curve with no curvature; in other words, the animal goes on re- 
sponding without reinforcement ad infinitum. Incidentally, Skinner remarks, 
such a result is a telling blow against a theory of extinction built upon the 
accumulation of a response-produced inhibition. 

Some theoretical importance was originally attached to the possibility 
that wider spacing of trials might have the effect of reducing the heightened 
resistance to extinction brought about by partial reinforcement (Sheffield, 
1949). Subsequent experimental work, however, has established that such 
a reversal does not always occur under spaced practice, and indeed proba- 
bly does so only under very special circumstances (Grant, Schipper, and 
Ross, 1952; Weinstock, 1954). 


INTERPRETATIONS OF PARTIAL REINFORCEMENT 

We have seen that the importance of partial reinforcement is twofold. 
First of all, a comparison of different schedules of reinforcement showed 
that momentary characteristics of response strength depend upon the par- 
ticular schedule of reinforcement under which an organism is performing. 
Secondly, we saw that partial reinforcement generally has the effect of in- 
creasing the resistance to extinction. This is perhaps the most fundamental 
point, and it is the one that has received the most attention from theorists 
and been applied most to theories of personality and development. Part of 
the interest in the effects of partial reinforcement has centered in efforts to 
explain why partial reinforcement so greatly prolongs the extinction process. 
Theories about these effects of partial reinforcement fall into two general 
classes stimulus theories and response theories. 

Stimulus Theories 

Stimulus theories of partial reinforcement say that resistance to extinction 
is greater after such a schedule because the stimulus situation is different 
from that in 100 per cent reinforcement. Reasons for the effect of increas- 
ing resistance to extinction are spelled out in several ways. Let us look at 
some of these. 

Expectancy. The notion of expectancy says that animals come to ex- 
pect, or predict, stimulus situations associated with particular responses. 
Thus, a rat reinforced 100 per cent of the time for pressing a bar comes to 
expect food after responding. If an animal is reinforced only part of the time 
during training, it will not expect a reinforcement on every occasion, so that 
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when reinforcement stops altogether, this animal is likely to be more per- 
sistent. 

This concept of expectancy has been roundly criticized on several counts. 
Some critics have claimed that it is an anthropomorphic notion. Others 
have pointed out that the definition of expectancy is vague and ambiguous. 
Certainly, the criticism of anthropomorphism is justified. Human learners 
are capable of expectancy — as a form of verbal behavior. We can tell our- 
selves when we ought to expect something to happen. It is doubtful that 
animals are capable of expectancy in this sense. 

The most serious criticism of the notion of expectancy is that it is an 
ad hoc explanation. It is acceptable enough in itself if, for example, it is 
defined as the ability to respond to a conditioned cue before the uncondi- 
tioned stimulus appears. When the buzzer sounds the dog salivates; then 
the meat powder is placed in its mouth. If this is all one means by expect- 
ancy, there is no quarrel. This definition of expectancy (and it is nothing 
more than a definition) does not provide us, however, with a very powerful 
explanatory tool. We may apply it to the problem of partial reinforcement, 
and by making some further incidental definitions, come out with the pre- 
diction that partial reinforcement will lead to greater resistance to extinc- 
tion than 100 per cent reinforcement. 

We have dealt rather harshly with this idea of expectancy. Later in this 

book we shall examine questions about decision making in human beings 

in which the notion of expectancy has been considerably more useful than 

It has been in the theory of simple instrumental and classical conditioned 
responses. 

Discrimination. Another simple idea used to account for the results of 
partial reinforcement studies states that under partial reinforcement sched- 
ules it is less easy for the animal to discriminate between conditions of re- 
inforcement and those of extinction. This implies that the major determiner 
of the rate (and probably the limit) of responding during extinction is the 
extent to which the situation during extinction is the same as that during 
original reinforced training. In partial reinforcement there are occasions 
when responses go unreinforced; consequently, when extinction is eventu- 
ally begun it is more difficult for the animal to discriminate between the 
conditions of reinforcement and those of extinction. 

It would seem that this discrimination notion is quite similar to a general 
concept of expectancy. The difference is that the discriminability hypothesis 
IS stated m such a way as to preclude our attributing symbolic, human proc- 
esses to rats as they generate extinction curves. The word expectancy has 
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too much excess meaning for most people. Perhaps it is the better part of 

scientific caution to accept the discriminabUity notion as part of the ex- 

p anation for the partial reinforcement effect and to dismiss the concept of 
expectancy. ^ 

The notion of discriminabUity has other advantages, as we shaU see later. 
The prmcipal of these is that we know some of the rules for discrimination 
earning (see Chapter 4) and can use them to predict the specific effects of 
p rticu ar patterns of partial reinforcement. At any rate we can accept the 
notion that the increased resistance to extinction resulting from partial rein- 
forcement can be considered as partly due to a change in the discrimina- 
bUity between the conditions of reinforcement and those of extinction. 


Response Theories 

other way to look at the question of what produces the increased re- 
r ^ extinction with partial reinforcement is to consider the definition 

that ^ most purposes we define the unit of response as 

lf>vp \ produce a reinforcement. Thus, in the Skinner box, it is a 
ho Jt^^i operating the feeder. It does not make any difference 

tok ^ pressed, as long as it is pressed hard enough. By the same 

partial reinforcement situation the unit 
feeH Tf ^ ^ ^i^rnber of lever presses necessary to activate the 

^ the unreinforced lever 

forcemp ^ reinforced ones to be part of the activity necessary for rein- 

reinforr^ response. Thus, if on a variable-ratio 

10 Ipv ^ reinforced 1 lever press in 10, we would count 

schediilp ^ response. On this variable-ratio reinforcement 

forcemeat sche''dLr"''"' equivalent to 1 on a 100 per cent rein- 

sh^ed advanced by Mowrer and Jones (1945), and they 

sDonspc o ^ responses are counted by this scheme, actually fewer re- 
course extinction after partial reinforcement, though, of 

cur artnipH ^ Presses occur. The reason that fewer responses oc- 
tion is ind ^®^es, is that a stronger response-induced inhibi- 

It would u ^ activity required under partial reinforcement, 

ereater mteresting to see if this were also accompanied by 

question ^ recovery, but Mowrer and Jones did not examine this 

hypotheses of^e^ P^'^sented this notion as an effect upon responding and the 
iiypoineses Ot eXDectanrv a:^ : • . ... — . .. 
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is, in the last analysis, largely a matter of convenience. Furthermore, there 
is nothing basically incompatible between these notions as they stand. It is 
possible that the unit of response as far as the organism is concerned is de- 
termined by what it “expects” or by the occurrence of a reinforcement. At 
any rate, while a more adequate explanation of the effects of partial rein- 
forcement upon extinction may eventually be stated, it is probable that it 
will be close to those discussed above. 

Importance of partial reinforcement. The effects of partial reinforce- 
ment — particularly on extinction — are extremely important when applying 
the principles of learning and behavior theory to the understanding of be- 
havior under natural conditions. Most habits seem to be highly resistant to 
the effects of extinction, and this is very likely because behavior in natural 
life is reinforced on a probable rather than a certain basis. Thus, the young- 
ster who is beginning to get a command of language is likely to be persistent 
about asking for cookies, ice cream, etc., because, in the course of things, he 
may be reinforced on a variable-ratio schedule. Sometimes these variable- 
ratio reinforcements produce behavior which is highly resistant to the effects 
of extinction. Parents who are concerned about controlling the behavior of 
their offspring will sometimes try to introduce fixed-interval reinforcement. 
Thus, some children are supposed to be able to buy ice cream on Mondays 
and Thursdays but not on Tuesdays, Wednesdays, and Fridays. Since par- 
ents are not nearly so consistent as the schedules of reinforcements set up 
for rats in Skinner boxes, the discriminations based on fixed-interval rein- 
forcement programs that appear in laboratory rats do not always appear to 
be clear-cut in children. 

Thus, inconsistency in the application of reinforcement is the stuff of 
which persistent behavior is made. How much an inconsistent application 
of reinforcements (and punishments) is responsible for a generalized trait 
of persistence we do not know. Not very much, it would seem, for eveh 
though parents differ enormously in the degree to which reinforcement and 
punishment are consistently applied, the rest of the world is likely to be 
much the same in this respect. Each youngster, at least in the easygoing at- 
mosphere of American culture, is likely to have been exposed to a fairly 
consistent pattern of inconsistency in the application of reinforcement. 

Partial reinforcement in choice and decision. A problem that has re- 
ceived considerable attention recently is that of finding out how irregular 
patterns of reinforcement affect decisions. More generally, this has been 
conceived of as a problem in the probabilistic nature of events that influence 
human and animal behavior. Brunswik (1954) for many years insisted that 
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the major variables in psychology were to be considered as only probabilis- 
tic, not certain, in nature. For instance, a cue for visual depth, or size, of 
retinal image, does not always yield unequivocally correct information. 
Objects that loom large in our visual field are probably objects close to us 
in space, but not necessarily so. So it is with reinforcements; they occur in 
a probabilistic way, seldom in an all-or-nothing way. 

With this general view in mind, Brunswik (1939) reported an experi- 
ment on rats in a T maze in which most of the time reinforcements occurred 
on the right side but sometimes occurred on the left. Under these condi- 
tions, rats came to distribute their choices according to the probability of 
reinforcement. Thus, if the right side were reinforced 75 per cent of the 
time, and the left side only 25 per cent of the time, the rats would choose 
to go to the right side 75 per cent of the time. 

Of great current interest in the theory of behavior is what determines the 
likelihood of an organism making a particular choice given the probability 
but not the certainty of a particular event happening. Most experimental 
research has been directed toward verbal choice in adult human beings. In 
a typical experimental situation, a subject is presented with a panel on which 
are two lights and two buttons. He is to predict by pressing a button which 
of the lights is going to flash on a particular trial. The experimenter has ar- 
ranged a particular sequence of order for flashes on the right or left. As 
with Brunswik s rats in the T maze, human beings apparently choose the 
alternatives in proportion to the frequency of the to-be-predicted event 
happening (Grant, Hake, and Hornseth, 1951; Estes and Straughan, 1954, 
etc.). This is a puzzling situation, for it is not the optimal strategy. We are 
not surprised, perhaps, that rats do not maximize the probability of receiv- 
ing a reinforcement, but we are perhaps a little surprised that college stu- 
dents do not maximize the probability of being right (given a situation in 
which the frequency of alternative events is not even the best strategy is 

always to bet on the more frequently occurring event, if we have no further 
information). 

It has been argued that this is a consequence of the learning mechanisms 
(Estes and Straughan, 1954; Bush and Mosteller, 1955), but if this is so 
it is an oversimplification. Hake (1955) has reviewed much of the literature 
on this topic and concluded that the conditions determining probability of 
a human subject making a given choice between two alternatives are influ- 
enced by several variables. There are reasons for believing, he points out, 
that in any such choice the subject is as much influenced by short sequences 
of events just before a particular choice as he is by the over-all frequency 
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of past events. Also, to some extent, subjects are influenced by the nature of 
their previous choices. Hake points out that the matching of objective 
probability is a result; it does not necessarily describe what individuals do, 
is this true of Brunswik’s rats? It is perhaps too early to arrive at a con- 
clusion. There are superficial parallels between the behavior of nonverbal 
subjects such as rats, pigeons, and young children under partial reinforce- 
ment, and the behavior of human adults in choice games with varying prob- 
abilities of making the right guess. Beyond this we can say little; we do not 
know if the parallels are anything more than superficial, and it is too early 
even to guess. We do know, however, that these experiments on human 
choice behavior that resemble certain experiments on partial reinforcement 
in animals so much are looking into important questions, and may eventu- 
ally have serious implications for all matters that involve human beings 
making decisions or choices. 


CHAPTER 4 





DISCRIMINATION AND LEARNING 

The chief function of learning for most animals in the world is to enable 
them to emit the right response at the right time. This generally means per- 
forming a response, on signal, that is already a part of the basic behavior 
of the animal. Most organisms seldom if ever find it necessary to learn 
skilled acts; much of the behavior (considered purely as response) of ani- 
mals IS unlearned. They must frequentiy learn, however, under what cir- 
cumstances certain behavior is appropriate. It is the circumstances of emis- 
sion rather than the behavior per se that require learning in most animals. 
To be sure, m the laboratory we sometimes require rats to perform skilled 
acts, such as dropping marbles down a narrow chute, but this kind of thing 
IS usually highly unnatural. More typically we take advantage of something 
a rat does more readily, such as exploring a narrow alley. Consequentiy a 
good deal of the learning a rat does in the laboratory involves simply the 
discrimination of stimuli in the environment and attaching appropriate re- 
sponses to these stimuli. Some psychologists have said that in this sense all 
learning is discrimination. 

Much of mnate behavior, particularly in the lower organisms, is highly 
discriminative; some instinctive act is likely to be emitted only in the pres- 
ence of a rather precisely defined (unconditioned) stimulus. This has al- 
ways been more or less understood, but in recent years we have greatly ex- 
tended our knowledge of the precision with which particular stimuli set off 
patterns of innate action. For example, in the three-spined Stickleback fish, 
a very crude model of a female will elicit the innate pattern of mating be- 
havior in the male, providing the model is painted red on its underneath 

surface (Tinbergen, 1951). Here the critical feature of the stimulus seems 
to be the red color. 

In this chapter we shall look at the way in which animals learn to dis- 
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criminate. Since our main purpose is to understand the nature of learning 
— particularly as it applies to the human animal — we shall ignore some 
fascinating accounts of unusual discriminations in nature in favor of a de- 
tailed treatment of the process of learning to discriminate. We should note 
that this is not a process that is limited to learned stimuli; some uncondi- 
tioned stimuli have their effects modified through discriminative training 
(see Tinbergen, 1953). 

Before we look at discrimination itself, however, let us take a brief look 
at its opposite, generalization. Here we ask, How much variation can we in- 
troduce into a response-producing stimulus before the response disappears? 

GENERALIZATION 

While investigating the conditioned salivary response in dogs, Pavlov 
soon discovered that the conditioned response was not limited to the stim- 
ulus originally conditioned. It would spread to other, related stimuli. Ex- 
periments from Pavlov’s laboratory (Pavlov, 1927) established the fact that 
if tactile stimulation of a small area of a dog’s skin is used as a conditioned 
stimulus, stimulation of other areas of the skin will also elicit the condi- 
tioned response, though the effect diminishes with the distance from the 
original conditioned stimulus. This important effect is what is now known 
as stimulus generalization.^ The two important points about stimulus gen- 
eralization are ( 1 ) that the effectiveness of conditioning is not limited to 
the stimulus originally conditioned, and (2) that the ability of a stimulus to 
elicit a conditioned response decreases with distance from the original con- 
ditioned stimulus. Since both of these points are basic to the psychology of 
learning, let us look at some experimental evidence about them. 

Generalization of classical conditioned responses. The experiments 
from Pavlov’s laboratory are, of course, experiments on the generalization 
of classical conditioned responses. Let us, however, look at some experi- 
ments from laboratories that have used other classical conditioned responses 

and have more carefully controlled the quantitative relations between 
stimuli. 

In one experiment, the investigators conditioned the galvanic skin re- 
sponse in man (elicited by an electric shock) to a vibrator applied near the 

1 “Irradiation” and “induction” are some of the other terms that have been used 
to describe stimulus generalization. 

2 The galvanic skin response (GSR) is the electrical activity of the sweat gland cells 
when they are excited. These glands are largely innervated by the sympathetic branch 
of the autonomic nervous system; hence this response is an “emotional” one. 
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shoulders (Bass and Hull, 1934). The strength of the conditioned response 
was then tested with various stimuli placed at increasing distances from the 
original conditioned stimulus. The results of this experiment are shown in 
Figure 13. The conditioned galvanic skin reaction was greatest when it was 
nearest the criginal conditioned stimulus, and diminished in size as the 
stimuli increased in distance from the original conditioning site. 



STIMULI 

Figur. 13. Cneraliiafion of a eondifion«f galvanic .kin r..pan« .a different rtimuii. S«n,uiu. 1 
the condrlioned .fimuiu. u.ed during training. Aii rtimuii used for testing differed from the 
conditioned rtimuio. oniy in iocation. (Data from Bass and Hall. 1934.) 

There have been many expieriments on the generalization of the condi- 
tioned galvanic skin response to a variety of stimuli— auditory, tactual, and 
visual (Hovland, 1937a, 1937b; Littman, 1949; Grant and Schiller, 1953; 
etc.). Most of these studies were not performed with the idea of establish- 
ing the validity of the principle of generalization — there seems to be little 
doubt about that but they were attempts to establish precisely the nature 
of the generalization gradient. The reason for all this experimental activity 
IS that there has been a widely accepted notion that there must be some 
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basic mathematical law governing the amount of stimulus generalization 
(Hull, 1950). Therefore a large number of experiments have been performed 
in an effort to discover the nature of this law. The sad outcome of all of 
these experiments suggests that there is not any such law. Rather, empiri- 
cally determined gradients of generalization take a variety of forms and 
have many different slopes. Some of the possibilities for these are illus- 
trated in Figure 14. 

Actually there are good reasons why an exact mathematical law of stim- 
ulus generalization has not been found. Some of these are purely mechani- 
cal. For example, even for the case of the galvanic skin response, there are 



Figure 14. Some theoretical possibilities for generalization curves. Each curve would be expressed 
by a diff6r6nt mothemoticai equation. 

many ways in which the response can be measured — as electrical resistance, 
as current flow, as electrical potential, and, if a-c circuits are used, as other 
quantities. Unfortunately there is no rational choice for the basic measure- 
ment that should be employed. This precludes a fundamental mathematical 
law of stimulus generalization for even so restricted a case as the galvanic 
skin response. When we try to generalize a mathematical law to other re- 
sponses, salivation, for example, the situation becomes even more hopeless, 
for here we introduce other arbitrary ways of measuring. 

Some theorists maintain that if probabUity of response occurrence is 
used as the sole psychologically meaningful measure of behavior, we can 
then establish a mathematical law of generalization. The difficulty is that 
there is no unequivocal and psychologically meaningful way of stating the 
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units on the sensory dimension along which generalization is supposed to 
occur. Bush and Mosteller ( 1951 ) do suggest an index of similarity between 
stimuli, but they are unable to provide any way of rationally relating this 
index to the physical characteristics, such as size, distance, and intensity 
that we use in measuring stimuli. Thus they are unable to suggest an exact 
mathematical statement of the principle of stimulus generalization. It is 
possible that this problem wUl soon be solved, but at present we must be 
content with the qualitative principles that stimuli do generalize their con- 
ditioned effects and that they generalize roughly to the extent that other 
stimuli are related to or resemble them. 

Generalization of instrumental conditioned responses. Generalization 
in c assical conditioning is a relatively straightforward matter, since in clas- 
sical conditiomng some stimulus is always used to elicit the conditioned re- 
sponse. In instrumental conditioning, however, we do not always attach a 
response to some particular stimulus, though it is usually perfectly feasible 
to do so. We can, for example, train a rat to press a lever in the Skinner 
box only upon signal. Such training ordinarUy involves some discriminative 
learning and generalization which comes after discrimination learning is 
quite different from that which comes before. Consequently in most instru- 
mental learning we look at generalization only after some discriminative 
learning has gone on; in order, however, to make a more exact parallel with 
classical conditioning, let us examine a case of stimulus generalization in 

instrumental conditioning in which there has been little or no discrimina- 
tion training. 


In this study (Grice and Saltz, 1950) rats were trained to run down an 
alley and obtain food by pushing back a small door. The door was embed- 
ded in the stimulus to which the rats were trained; it consisted of a white 
circ e on a black background. The rats were trained to run to a stimulus of 
a certain size, say, 79 square centimeters, and then were tested with circles 

sizes (for example, 50 square centimeters). The test consisted 
in n ing out how many times the rats would run during extinction (no 

reinforcement). 

The results of the experiment are shown in Figure 15. As we can see, 
well-defined gradients of generalization occurred. The more remote the size 
of the test stimulus from the training stimulus, the fewer the responses dur- 
ing extinction. Thus it is clear that if a reinforcement is associated with a 
particular stimulus during instrumental training, some of that association 

will spill over” to stimuli related to, but not the same as, the stimulus 
originally conditioned. 


DISCRIMINATION AND LEARNING 


79 


The nature of generalization. The existence of gradients of stimulus 
generalization both for classical and instrumental conditioned responses is 
not in doubt. There has been, however, considerable argument over the na- 
ture of these gradients. Lashley and Wade (1946), for example, have ar- 
gued that they simply represent absence of opportunities for discrimination. 
In other words, they argue, if animals are not allowed to compare stimuli 
simultaneously, they find it difficult to discriminate between closely related 



AREA OF TEST STIMULUS IN SQUARE CENTIMETERS 

Figure 15. Generalization gradients for an instrumental response. The two curves show the mean 
number of responses during extinction to various test stimuli. The solid curve shows generalization 
after training on the 20 square millimeter stimulus, and the dotted curve shows the same thing on 
the 79 square millimeter stimulus. (Grice end Salfz, 1950.) 

stimuli. Thus, generalization does not represent a spill-over of the influence 
of habit so much as it does the failure to discriminate along some sensory 
dimension. The fact is, as we shall see later, that animals can learn to dis- 
criminate stimuli even when they are presented successively rather than 
simultaneously, so any failure of discrimination in experiments on general- 
ization more than likely represents a failure of learned discrimination, not 
a failure in sensory capacity for discrimination. 

It is probable that the particular generalization gradient exhibited by an 
organism under test in the laboratory, even if no specific discriminative 



80 


THE PSYCHOLOGY OF LEARNING 

training has been given, represents the effects of the history of an organism 
with sensory learning. Even a laboratory rat that has been caged in an un- 
eventful animal colony all its life has had some opportunity for discrimina- 
tive leammg, and this probably has some effect on laboratory tests. We do 
not know much about these effects, but ever since the appearance of Hebb’s 

book The Organization of Behavior (1949), learning theorists have become 
increasingly aware of their existence. 

Extinction also generalizes. We have seen that the effects of reinforce- 
ment generalize to stimuli other than those specifically used in training. As 
might be expected, the effects of extinction also generalize in this way. Let 
us Ulustrate this with an example, from classical conditioning (Hovland, 
1937a). Suppose we condition a galvanic skin response in human subjects 
by pairing an electric shock with a tone of a certain frequency. After con- 
ditioning we shall, of course, find that another frequency will elicit the re- 
sponse. Suppose we then extinguish the conditioned response to the second 
tone. If we then go back and test with the first tone, we shall find that ex- 
tinction of the response to the second tone also reduced the tendency to re- 
spond to the first. Thus the effects of extinction generalize as do the effects 
of reinforcement. 

The generalization of the effects of both reinforcement and extinction 

are extremely important in the theory of discriminative learning, and a 

large portion of this chapter, as well as later chapters, will be devoted to the 

application of the principles of stimulus generalization to an understanding 

of discnminative learning and of related questions in the psychology of 
learning. ^ 


DISCRIMINATIVE LEARNING 

K niake discriminations can be looked upon as the process of 

breaking down generalizaUons. This may be achieved by reinforcing re- 
sponses to one stimulus and extinguishing responses to another. Discrimina- 
tion is achieved when the organism comes to respond to one stimulus and 
not to the other. 

There are a variety of specific procedures for producing discriminative 
leammg m laboratory animals. One of the best known of these involves the 
use of the Lashley jumping stand. An example of this device is shown in 
Figure 16. In studies of discriminative learning using the jumping stand, 
rats are trained to jump from a platform about 8 inches from the stand to 
one of two doors in the stand. Light pieces of cardboard are placed over the 
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doors, and these carry the stimuli. The problem illustrated in Figure 16 re- 
quires the rat to discriminate between vertical and horizontal stripes. If the 
rat makes the correct choice, the door falls and the rat is allowed to eat 
food placed inside the stand. If it makes the wrong choice, the door does 
not move and the rat falls to the net below. Thus, typically, the Lashley jump- 
ing stand makes use of punishment. 

In other types of discriminative studies single choice-point mazes and 



Figure 16. The Lashley jumping stand. The rot jumps from the stand to one door or the other. If 
the choice !s correct, the stimulus card gives way; if the choice is wrong, the rat falls to the net 
below. {Lashley, 1935.) 

runways have been used. In order to study discrimination in animals not 
ordinarily seen in the psychological laboratory, birds and reptiles, for ex- 
ample, special techniques must be devised. The fundamental rule remains 
the same however; animals are required to choose between a stimulus that 
is associated with reinforcement and one that is not (or is associated with 
some unpleasant stimulus). 

The way in which animals learn such discriminations appears simple 
enough; with increasing frequency the correct alternative is chosen. If the 
problem is easy the animal learns to make the correct choice in very few 
trials, and if it is difficult the animal may not discriminate at all. Usually 
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the difficulty of the problem is determined either by the closeness of the dis- 
crimination required or by the complexity of the stimuli presented. Despite 
the outward simplicity of the learning process, however, the events that go 
on in the organism must be very complicated. Let us look at some of the 
attempts to deal with this problem theoretically. 

Theories of Discriminative Learning 

Most current theories of discriminative learning fall into two classes. 
One of these places great emphasis upon the gradients of stimulus general- 
ization and upon the gradual accumulation of habit strength to the correct 
stimulus. The other class stresses the importance of the active problem- 
solving character of discrimination and the all-or-nothing characteristic of 
hypotheses that animals might employ in learning to discriminate. 

Unfortunately no single theorist of either class has dealt with all of the 
problems in discrimination learning. Therefore, in stating these theories, 
we must draw on the work of different theorists. By combining their views 
m this way, we probably do an injustice to the individual theorist. At the 
present time, however, there seems to be no more satisfactory way of ob- 
laming a synthesis of these theoretical problems. 

The Algebraic Summation and Continuity Theory 

The theory of discriminative learning that makes greatest use of the con- 
cepts of stimulus generalization and of the gradual accumulation of habit 
strength is primarily the work of Spence (1936, 1937a, 1940, 1952), 
though most of his fundamental notions are close to those of Hull (1943, 

1952). Let us look at some of the principles these theorists have empha- 
sized. 

First of these theorists have insisted that gradients of stimulus gen- 
eralization interact algebraically. Suppose that an animal is trained in a 
iscriminative problem by the reinforcement of response to one stimulus 
and exunction of response to another. The effects of reinforcement will 
generalize to all other related stimuli, and so will the effects of extinction. 
According to this view the strength of the tendency to respond is obtained 
by subtracting the strength of generalization of extinction from the strength 
of generalization of reinforcement for any particular stimulus. Thus the 
following equation expresses the notion: 

Ra = R — I 

Ra is the net strength of response after the inhibition from extinction is sub- 
tracted. 
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This hypothesis is illustrated in Figure 17. Here, the solid curve repre- 
sents the generalization of conditioning R; the dashed curve, the generaliza- 
tion of extinction /; and the dotted one, the net response strength after / 
has been subtracted from R for every stimulus. The exact form of the gen- 
eralization curve, as well as the values of its constants, is, of course, un- 
known, but despite this limitation this idea of algebraic interaction leads to 
important conclusions. 



Figure 17. Illustrates the algebraic-summation notion of discrimination learning. Net response 
strength is the result of the subtraction of the effects of generalized extinction from those of generol- 
Ized reinforcement. (Suggesiod from Hull, 1943.) 

Spences analysis of transposition. One of the most interesting by- 
products of the algebraic summation notion is the way in which it can be 
used to account for a special phenomenon in discriminative learning known 
as transposition. Transposition is the apparent ability of organisms to make 
relational discriminations. We can best illustrate what these are by example. 

Kohler (1915) once trained chickens to respond (with food as rein- 
forcement) to the darker of two gray surfaces. The chickens were never 
reinforced when they responded to the lighter surface. When this discrimi- 
nation was well established, the animals were presented with a new choice 
between the original reinforced gray and one darker still. The interesting 
thing was that the animals chose the darker of these two grays, even though 
they had always been reinforced for choosing the other one. In other words. 
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argued Kohler, the animals had learned the relationship “darker than” 
rather than the association of the original stimulus with reinforcement. 

This result presents a paradox for the theory of learning, for here is a 
case in which reinforcement apparently has no effect (or even a negative 
effect) on choice. It turns out, however, that this example of transposition 

can be explained by the principle of stimulus generalization. Let us turn 
to Spence’s explanation. 

Spence (1937a, 1937b) uses an illustration based on a study of the dis- 
crimination of visual size in chimpanzees. He trained animals to discrimi- 
nate between two scjuares, 256 square centimeters and 160 square centi- 
meters in size. The response to 256 was reinforced and the response to 160 
was extinguished. Then, after the discrimination between stimuli 256 and 
160 was established, the values of the stimuli were shifted to 256 and 409. 
The animals chose 409, a stimulus which had never been reinforced. 

The theoretical analysis of this problem is presented in Figure 18. The 
extinction to stimulus 160 generalized to 256 to some extent; furthermore, 
the conditioning to stimulus 256 generalized somewhat to 409. To arrive 
at the comparative value of response strength after discrimination of 256 
and 409, we subtract the amount of generalized extinction from the amount 
of generalized conditioning. As shown in Figure 18, when the net response 
strength is obtained by this procedure, it is greater for 409 than for 256. 
Thus, we would predict that, on the average, animals would choose to re- 
spond to 409 rather than to 256. This is a prediction of the transposition 
effect. 

The generalization curves in Figure 17 do not yield the transposition 
effect, in this figure the highest net response strength occurs to the stimulus 
originally reinforced. From a comparison of this figure with Figure 18, we 
can infer that the prediction of whether or not transposition will occur de- 
pends upon the shape, relative location, and slope of the generalization 
curves. This latitude offered by the selection of arbitrary values (presum- 
ably to be determined experimentally) for curves of generalization is, theo- 
retically, both an advantage and a weakness.. It is a weakness in that the 
theory can provide no rational ^ way of exactly stating the quantitative as- 
pects of its assumptions; it is an advantage in that it suggests that trans- 
position will occur only sometimes. The gestalt notions from which the 
transposition problem originally comes imply that transposition will always 

® Spence (1942) does present some empirical constants for a particular form of 
generalization, but since we have already seen something of the variability of obtained 
generalization curves, we cannot stretch his curves to fit many cases. 
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^cur.The theory based on generalization gradients suggests that occurrence 
de^nds upon how we affect the generalization gradients by training, etc. 

The obvious way to prevent transposition is to make the positive and 
negative stimuli very different from each other. A demonstration of what 
this would do can be seen by moving the curves in Figure 18 apart. In gen- 
eral, experimenters have found that when they move stimuli far apart there 
IS a failure of transposition (Spence, 1937a; Webb, 1950). Two s’tudies 
(Kendler, 1950; Ehrenfreund. 1952) have systematically explored the 



effects of increasing the difference between stimuli, and the results are what 

would be expected from generalization gradients-the percentage of t 

sponses showing transposition decreases. ^ 

Several experimenters have studied the problem of transposition in chil 
dren (Kuenne, 1946; Hunter, 1952; Alberts and EhrenfreuL, 1951) Ar 
guments have been advanced to the effect that ac rhiiH ' r 
verbal to verbal stages their solution oTSe^a^^^^^^^^ 'T 

from one based upon generalization gradients to one more hkHd^uirfor' 
mation of relationship” concepts (Kuenne 1946 - t 

McConnell, 1955). No critical evidence exists that’ t ' 

teresting hypothesis, and we shaU encounter it later. 
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Despite the considerable merits of the stimulus-generalization derivation 
of transposition, we should not lose sight of the fact that sensory experi- 
ments have conclusively demonstrated our perception of objects to be de- 
termmed by the context in which they are embedded (gray compared with 
black looks different from the same gray compared with white). In this 
sense, even discriminations based on the subtraction of generalization gra- 
dients should be relational; i.e., the generalization gradients themselves will 
be modified by perceptual relationships between the stimuli. FinaUy, rats 
wi transpose more readily when the stimuli are presented simultaneously 
(Baker and Lawrence, 1951). Furthermore, rats will unambiguously make 
re ational choices when they are presented with stimuli that clearly imply 
relationships. For example, relational choices are more likely if the stimuli 
are of two brightness levels rather than one (Lawrence and DeRivera, 1954). 

Other uses of the algebraic summation principle in generalization. The 
basic notion used in the prediction of transposition from generalization 
gradients, namely, the algebraic summation of generalized habit tenden- 
cies has many other applications. It has been generally assumed by those 
or 'ing in the psychology of learning that stimulus generalization is a 
asic component in the ability of organisms to transfer what they have 
rned Lorn one situation to another, similar one. The relationships be- 
tween stimulus generalization and more general problems in the transfer 

^ > cr, an^l we must postpone a dis- 

cussion o some of these. Part of the difficulty, of course, is our inability to 

specify the exact form of generalization gradients. It should be noted that 
some progress is being made; Lawrence (1955), for example, has pointed 
out certain important restrictions that must be incorporated into generaliza- 
tion gradients if they are to account for the existing data on discriminative 

earning. Lawrence suggests that only certain types of concave downward 
gradients are adequate (see Figure 14). 

The Noncontinuity Theory 

The theory of discriminative learning we have just looked at places great 
emp asis upon the gradual accumulation of habit strength and the algebraic 
summation of gradients of generalization based on reinforcement and ex- 
tincUon. In general this theory has been contrasted with one which empha- 
t e problem-solving behavior of organisms in discriminative learning. 

This is generally called a noncontinuity theory, in that it implies that learn- 
ing a discrimination is not a continuous accumulation of positive and nega- 
tive abit strength. The essential idea is that animals try out hypotheses 
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about the discriminative problem they are required to solve. Let us see 
now this might work. 

A rat IS trained to jump to one or another window in the Lashley jump- 
mg stand. The problem for the rat .s to learn to jump to the white window 
and to avotd the black one. When .nitially faced with a choice between the 
ite and black, the white window might by chance be on the ri^ht side 
f the rat jumps successfully to the white window, it might learn either 
hat white ,s correct or that the right side ,s correct. l,et us suppose that 
i learns right to be correct. The rat will then work on this hypothesis until 

hvD th "I'gh' 'hen adopt some other 

asoect ^’'''ingency of reinforcement and some 

aspect of the environment. This would continue until the rat happened on 

e correct hypothesis and kept at it long enough to convince the expert- 
nienter that the rat had solved the problem. 

In other words, instead of gradually learning a single habit through the 
(and7nT“°" ‘he rat would have learned 

This notion-that animals adopt hypotheses in solving probleras-has 
een advanced by many investigators, most notably Krechevsky ( 1932 ) and 
Lashley ( 1938 , 1942 ; Lashley and Wade, 1946 ). The nonconLuity theory 

thanT' by one author, and many more points 

nearly all hts publications emphasizes the momentary attentional set the 

tmal may have m perceiving a portion of the stimulus field set in front 
It oy the experimenter. 

ten7d to7''‘' noncontinuity viewpoint have 

tended to use experimental problems that arise out of intuition and their 

own experience. The continuity theorists, on the other hand, have tended 

o develop experimental hypotheses more rigorously from formal (Id 

nietimes mathematical) principles. In the light of this difference in ao 

proach It IS not always possible to compare the continuity and nonconli 

PumV 7’ experiments designed for this 

purpose and see what the outcome has been. 

Comparison of the Continuity and Noncontinuity Theories 

The ejects of changing the problem. Some of the best evidence on the 
o jarative adequacy of the continuity and noncontinuity theories has 
ome from experiments in which a problem is altered pari way through 
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the training program. This has most frequently taken the form of reversing 
the discriminative cues. Suppose that we start to train a rat to discriminate 
between black and white cards. We reinforce the black card and extinguish 
the white one. Then after a few trials, we reverse the cues, so that the white 
card is now associated with reinforcement and the black card is extin- 
guished. What effect will this have on the animal’s eventual rate of learning 
the problem? If the animal had been following an erroneous hypothesis (a 
position hypothesis, say) during the initial phase of the training, we should 
expect no effect if we reversed the cues, since the animal had not yet begun 
to associate the black and white cues with the reinforcement and its absence. 


If, however, each trial adds a small increment both of habit strength to the 
positive stimulus and of inhibition to the negative stimulus, as the conti- 
nuity theory would suggest, then reversing the cues ought to lead to nega- 
tive transfer or interference. 

Quite a few experiments which use the technique of reversing cues dur- 
ing the presolution period have been reported. Nearly all of these show 
that reversing the cues does retard learning (for example, McCulloch and 
Pratt, 1934; Ehrenfreund, 1948; Ritchie, Ebeling, and Roth, 1950; Grice, 
1951 ). If cues are reversed every few trials, animals eventually come to do 
better on successive reversals. Figure 19 shows that there is interference 
(reversals result in more errors) the first few times a pair of stimuli is 
reversed. After several reversals, however, animals come to do better than 



Figure 19. The number of errors made on successive problem reversals. The number of errors on 
each problem increoses rapidly at first, but after a few reversols, errors are less frequent than on 
the initial problem. (Got/mg, J952.) 
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they would have done with no reversals at all (Gatling, 1952). This is much 
like the learning-set” phenomenon discussed below (also see North, 1950). 

It is clear, then, that the results of discriminative experiments in which 
the cues are reversed favor the continuity view. In addition to these ex- 
periments, there are some in which cues are modified rather than reversed 
during the presolution period and still others in which more than one kind 
of cue is used. Let us look at some of these experiments. 

One that seems to favor the noncontinuity view is an experiment by 

Lashley (1942). Lashley first trained some rats to discriminate between 

large and small circles (the large circle was reinforced). Then Lashley 

continued the training, but this time he substituted a large triangle for the 

large circle. Thus the animals now discriminated between a large triangle 

and a small circle. Yet, these animals failed to make the distinction in shape 

for when they were tested with the large triangle and the large circle they 

showed no preference. Furthermore, they showed a preference for a large 

circle over a small triangle. The results of this experiment suggest that the 

rats did not learn the incidental cue (shape), but only the cue they had 
been originally trained on (size). 


On the other hand, there are experiments which suggest that rats do 
learn such incidental cues. For example, Bitterman and Coate (1950) 
showed that if a spatial cue is made incidental to the learning of a bright- 
ness di.scrimination, the spatial cue is learned also. It is obvious that this 
question is by no means settled. Some experiments that we shall examine 
later will suggest a way of reconciling the findings that sometimes inci- 
dental cues are learned and sometimes they are not. 

Successive versus simultaneous discrimination. Another comparison of 
theoretical importance in discriminative training is that between simultane- 
ously and successively presented cues. (It is not always clear which theory 
IS on which side, however. Compare Grice, 1949, with Spence, 1952.) The 
problem is especially important to the matter of relational versus absolute 
discrimination. If animals respond to relationships between stimuli in 
learning how to discriminate, then successive presentation of stimuli should 
make the problem more difficult than simultaneous presentation. If, how- 
ever, animals respond simply to the positive reinforcement of one stimulus 
and to the extinction of the other, it should not make much difference all 
other things being equal, whether the stimuli are presented together or not 

One of the best experimental tests of this problem is by Grice (1949) 

It IS difficult to compare the learning which occurs to simultaneously pre 
sented stimuli and that which occurs to successively presented stimuli be- 
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cause in the latter there are no errors, since the animals have no choice to 
make. Grice, however, estimated the errors in successively presented stimuli 
by counting as errors all runs to the positive stimulus with latencies greater 
than the median latency and all runs to the negative stimulus with latencies 
less than the median. With this estimate, there was no difference in the 
learning curves for the two conditions. Thus, Grice concluded, it is not 
necessary for animals to compare stimuli in order to learn a discrimination. 

Spence (1952), however, argues from the absolute theory of discrimina- 
tion that under certain conditions simultaneous presentation should be 
easier for animals than successive presentation. He cites data from his 
laboratory and disputes evidence advanced by Weise and Bitterman (1951) 
showing simultaneous presentation to be more difficult. 

The problem is a complicated one. It is clear, as Nissen (1950) points 
out in a paper discussing earlier experimental work, that discrimination is 
to a large extent the establishment of approach tendencies to positive 
stimuli and of avoidance tendencies to negative stimuli. The experiments 
on successive versus simultaneous discrimination, however, show that the 
learning of even a moderately simple discriminative habit is too involved to 
be easily explained by any existing theory. The experiments by Bitterman 
and others are complicated, as Spence has remarked, by the existence of 
patterning between positive and negative stimuli; the extent to which this 
influences ease of discrimination is unknown. One respect in which Grice’s 
experiment is different from the others is that it is not complicated by stimu- 
lus patterning. Thus we should probably accept his experiment as demon- 
strating that absolute tendencies to approach or avoid stimuli can be 
established, though relationships between stimuli may be fundamental to 
real-life animal discrimination. 

An evaluation of the two theories. It should be evident by now that 
experimental data on discriminative learning have far outrun theory. Cer- 
tainly the absolute, continuity theory is correct in asserting that animals 
can be trained to have positive responses to certain stimuli without com- 
paring these stimuli to others on the same sensory continuum. In the process 
of training, even though animals give no evidence of having grasped the 
discrimination, experience at the task has effects which can be detected 
by negative-transfer tests. On the other hand, there is also abundant evi- 
dence that animals do not simply react to any stimuli that come within 
range; they are selective in what they attend to. Furthermore there is good 
evidence that animals test preliminary “hypotheses” about the nature of the 
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problem. This, of course, is reflected m the experimental literature; this is 
why there are no unequivocal data to support the extant theories. From the 
van age point we have at the moment, it appears as though an adequate 
eory wiJ be a synthesis of many current notions. The idea of stimulus 
generalization, or something like it, is important and will probably be re- 
ained as an essential element in theory making. In addition there will 
probably be notions describing more adequately the selective nature of 
attention and hypothesis in discriminative learning. 

THE DEVELOPMENT OF DISCRIMINATIVE CAPACITY 

thatofTe°d""?'“'''""' experimental attention is 

second^ th «f cues in discrimination, and 

second y, there has been an elaborate experimental program to study the 

gradual improvement m discrimination that comes from learnin<. how to 

o-scnminate. Let us first consider the problem of acquiring distinctiveness 

The acquired distinctiveness of cues. Much of .1 1 . 

of discriminative learning has incorporated the assumptiorthat cues are 
always learned to particular responses-that learning is a simple matter of 
imulus-response bonds. Nearly all theorists agree, however that at he i 
this IS an oversimplification, and fecently there have Len 

designed to show that discriminative iLrning involves 

central associations. Some of the best of the! . mediated or 

the acquired distinctiveness of cues. xperiments have been on 

Lawrence and his associates (Lawrence 1940 losn d 

™ce. ,953, have .h., when' ani^ '^,7; ''™‘ 

between cues thev ran ^ discnminate 

which entirely different instrument \ ^ '^crimination to new situations in 

tinction betwLn cues Is noreZr ‘^e dis- 

in discrimination learning and the^ f responses used 

lius. inus, m addition to consider- 

^ Much of the evidence for this comes fmm ^ u 
and Error.” This refers to the fact that aZals called “Vicarious Trial 

at choice points in discriminative learning WoL '" vacillation and indecision 

presented a good review of the effect. ^ dworth and Schlosberg ( 1954 ) have 
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ing the factors of positive and negative stimulus generalization in any com- 
plicated learning problem, we must also consider the extent to which 

particular cues acquire distinctiveness for animals (see Bauer and Lawrence, 
1953)! 

An important outcome of this work has been to point toward the solution 
of some of the issues in the continuity-noncontinuity controversy. Lawrence 
and his associates (Lawrence and Mason, 1955; Goodwin and Lawrence, 
1955) have suggested that in complicated discrimination problems, ani- 
mals can learn several habits simultaneously. First of all they can learn to 
orient toward a particular kind of stimulus (black versus white cards, 
etc.), and secondly they can learn the actual discrimination (white is rein- 
forcing). Suppose, however, there are other cues present, such as the height 
of a hurdle over which the animal must jump. If the experimenter changes 
the problem so that the height of the hurdle is the cue, the animal need not 
necessarily unlearn the black-white discrimination. All that is necessary is 
that the orientation towards brightness be extinguished. If this is so, the 
animal can learn the new problem without disturbing the old discrimination. 
Thus, in complicated discrimination situations it is possible to obtain be- 
havior in which the rat appears to engage in particular hypotheses without 
disturbing other hypotheses. Notice that this derivation of a complex effect 
in discrimination learning is built upon the simple mechanisms of reinforce- 
ment theory. Lawrence has devised many ingenious experiments which 
point to the way in which problem solving and relational discrimination can 
be derived from simple mechanisms. The most important step is the aban- 
donment of the simple stimulus-response paradigm. Once we grant the 
possibility of connections between stimuli and nonovert mediated processes 
in the animal, it becomes possible to derive complicated effects in discrimi- 
nation learning from simple premises. 

Reid (1953) has presented the same notion, namely, that in addition to 
learning a discrimination animals also learn something about the nature of 
discriminative learning, and this can account for the development of non- 
continuity behavior out of continuity-like processes. Reid demonstrated the 
adequacy of this idea by showing that there was more rapid learning of a 
discrimination reversal after a long period of overlearning on the original 
problem. Reid points out that Spence and others have suggested the need 
for a receptor-orienting act, or something like it. In simpler discriminative 
experiments receptor orientation, or learning to notice particular cues, is 
unimportant because only a limited number of cues are presented to the 
animal. When conditions become more complicated, however, this orienting 
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becomes a more important determinant of behavior. It is significant that the 
experimental evidence shows this orienting is learned in much the same 
way that other kinds of discrimination are learned. 

Discrimination-learning sets. The examples of discrimination discussed 
thus far have been those in which animals obviously have had some diffi- 
culty m learning. Yet it is possible to find examples of discrimination in 
animals in which there is practically no learning, because the animal solves 
the problems immediately. In some experiments, for example, two stimuli 
are presented to monkeys; one of the stimuli, say, is a red cylinder and 
the other is a blue pyramid. Under one of these objects is hidden a peanut. 
When these stimuli are exposed, the monkey carefully examines each and 
chooses to look under one. If, by chance, it looks under the wrong one, on 
the very next trial it looks under the other one. The discrimination is per- 
fect, and It IS only a matter of chance whether the monkey discovers the 
peanut on the first or on the second trial. 


Immediate and perfect solution of this and much more difficult problems 
comes from a history of learning to make discriminations of this sort As 
animals solve successive discriminative problems, they show an orderly 
and gradual improvement in this ability. Such an improvement in ability 
Harlow has called the formation of learning sets (Harlow, 1949). The 
demonstration of this gradual improvement is extremely important because 
It suggests a direct continuity between the most primitive examples of dis- 
cnminative learning and the insightful behavior of sophisticated animals. 

Une kind of behavior arises out of the other. 


An analysis of the way in which this happens has just begun. One im- 

Lobir H (Riopelle, 1953)_transfer from problem to 

fill ,1 I of oues associated with success- 

ful solution becomes less and less as additional problems are solved In 

other words, animals learn that they cannot rely on stimulus generalization 

probleml Th ‘'i! ^“^^essive 

earli e thir b """ we discussed 

earlier either becomes completely suppressed or radically altered in char- 
acter. Warren (1954) points out that test-wise monkeys approach each 
recombination of stimuli as a new problem however many timL the specific 
muh required in the discrimination had been differentially rewLded 
Thus even if one of the stimuli were much more frequently reLrded than 

more often. Warren suggests that this means that the analysis of discrimina- 
tive learning according to the excitatory and inhibitory strengths built up 
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to particular stimuli as the result of reinforcement and extinction does not 
apply to animals that have learned many problems. 

Thus as a result of long experience at discrimination, animals learn some- 
thing new. They are no longer limited to the information that red objects 
or square objects are associated with reinforcement; they learn that one 
of two stimulus objects will be reinforcing or that one that was not reinforc- 
ing on the previous trial will be reinforcing on the next trial. Such learning 
is truly the solution of a problem, and we shall reexamine discriminative 
learning sets when we consider the role of learning in problem solving. 

PERCEPTUAL LEARNING 

The influence of learning on perception. Almost as old as philosophical 
inquiry itself is the question, Is man’s ability to perceive objects of the 
world innate or is it the result of learning? In modern psychology this ques- 
tion had long been dormant because nobody seemed to be able to suggest 
a reasonable or even testable solution to the question. In recent years, 
largely because it was raised in a new context (Hebb, 1949), there has been 
a revival of interest in the matter. Specifically, the question has been raised 
as to whether the normal visual and tactual perception of shape and pattern 
depends upon learning or whether it is an inherent property of the nervous 
system. 

In the first experiments aimed at exploring this question, apes were 
reared in darkness from birth to the time of testing. These apes, deprived 
of all visual experience, were tested on visual discrimination problems. The 
results showed an astounding deficiency in visual form perception; these 
findings were not conclusive, however, because there was evidence of a 
deterioration in the physical capacity of the eyes because of the long period 
of disuse (Riesen, 1950). 

Several subsequent experiments have confirmed the earlier observations, 
and in these the possibility of deterioration of the sense organs was fairly 
well ruled out. In one (Siegal, 1953), doves were reared with translucent 
head covers. These covers permitted light to come through (thus not de- 
priving the retina of stimulation) but did not allow definition of visual form. 
Birds reared with these covers took much longer to learn simple form 
discriminations than did normal birds. It seems certain that experience was 
a normal prerequisite for visual perception in these birds, though there is 
some question as to whether a lack of visual-motor coordination was not 
the basic factor. 
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In another experiment (Nissen, Chow, and Semmes, 1951 ) a chimpanzee 
was reared deprived of tactual, kinesthetic, and manipulative experience. 
The results were much the same as those found in the experiments with 
the ring doves. This animal was grossly inferior to normal ones in the abil- 
ity to learn tactual discriminations. Again it is not clear whether the depriva- 
tion in experience resulted in a purely sensory deficiency or whether the 
deficiency was one in sensory-motor coordination. 

These experiments point to the great importance of experience in the 
performance of even the most simple perceptual-motor acts. The deficiencies 
that result from deprivation are not the result of failure to learn how to 
discriminate generally, since the animal is free to perform discriminations 
in Its other senses. The deficiency is specific to the sense deprived. If it turns 
out, as Hebb believes, that the deficiency is a result of the failure of the 
nervous system to learn the nature of forms in the perceptual world, then 
the role of infantile learning is enormously more important than anyone 
has ever thought. There is strong incidental evidence to support Hebb’s 

views, since it is difficult to understand how the perceptual constancies 
could arise unless perceptual relations are learned. 


Response versus Place Learning 

Experiments on infantile sensory deprivation and perception suggest that 
learning of perceptual relationships can occur without the intervention of 
motor acts. Other sources supply good evidence for this. As we have seen 
some of the experiments on complex discriminative learning suggest that 
It IS necessary to assume that some discriminations are learned wiffiout be- 
ing attached directly to motor responses. In the preceding chapter it was 
also pointed out that instrumental conditioned responses could be extin- 
guished without the performance of the originally learned response Another 
problem arising from this question of pure perceptual learning concerns 
he nature of what animals, particularly rats, learn in mazes. A pure stimu- 
lus-response theory would say that rats learn chains of discriminations 
attached to chains of responses. Thus, at a particular signal, the rat learns 

to urn right. According to this view, rats would never really learn locations 
only responses. ouuns, 

A considerable literature exists, however, purporting to show that rats 

kamT" T" responses; they do indeed appear to 

learn locations. In one experiment (Tolman, Ritchie, and Kalish 1946) 

tt p“f th — d 

pattern of the maze, new pathways were supplied and the old, direct 
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route to the goal was blocked. Most of the animals chose new pathways, 
which suggested they had learned the general spatial location of the goal’ 
In another experiment, Galanter and Shaw ( 1954) showed that place learn- 
ing is demonstrated in rats if there are many environmental cues, while 
an earlier study (Thompson and Thompson, 1949) suggested that place 
learning is absent if spatial cues are minimized. 

It is clear from these and other studies that rats can learn “places” as 
well as chains of stimulus-response associations. It is probable that the 
highly organized perceptual learning implied in the learning of “place” 
relies on the gradual development of more primitive discriminative learning. 
It probably depends upon the growth of perception implied in the experi- 
ments on early sensory deprivation. It also probably hinges on the de- 
velopment of “learning sets” out of multiple experiences with simpler 
stimulus-response discriminative problems. Thus the learning of “place” 
would appear to be a higher-order example of perceptual learning. 

Tolman (1932, 1951) has long insisted upon the importance of perceptual 
learning in the behavior of animals and men. Despite Tolman’s belief, there 
has been relatively little analytical work directed toward finding out about 
the characterisfics of this kind of learning. Perhaps an extension of the work 
on acquired distinctiveness of cues and on learning sets will help us to 
understand perceptual learning more adequately. 


SOME CONCLUDING REMARKS 

Discriminative learning is an important and fascinating business, and 
there are many problems we have not discussed. For one thing we have 
not said much about discriminative learning in adult human beings. Many 
of the experiments on animals have been duplicated at the human level. 
For example Loess and Duncan (1952) showed that sn cr.ecsivp presenta - 
tifiD-SL§ihuuli g enera ted poorer learning than simultaneous presentation, if 
the discrimination was difficult. Another investiga'tor' (Walk, 1952) showed 
that reversal of a discriminative problem resulted in a decrement in human 
just as in animal subjects. Despite the fact that the outcomes of these 
experiments are similar to those conducted with animals, we suspect that 
discrimination in human beings is much modified by verbal and symbolic 
processes. In a later chapter on problem solving we shall return to some of 
these questions concerning human discriminative learning. 

More important is the demonstration of gradients of stimulus generaliza- 
tion in voluntary human behavior. Stimulus generalization is basic to the 
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ability of organisms to transfer what they have learned from one situation to 
another. The relationships between stimulus generalization and even simple 
transfer are quite complicated, however, and we must postpone a discussion 
of these relationships until other important factors in human learning have 
been examined. Suffice it to say at present that stimulus generalization of 
verbal responses to symbols and stimuli has been well established in human 
learning (Yum, 1931; Gibson, 1941) and in motor behavior (Brown, Bilo- 
deau, and Baron, 195 1 ) . These demonstrations are important, for they enable 

us to make confident use of the notion of stimulus generalization in human 
learning. 

Unfortunately, there are many other questions about discriminative learn- 
ing that we shall have to ignore. One of the most interesting concerns the 
relation of capacity for discriminative learning to the natural behavior of 
animals and their mechanisms of survival. If this were a book about animal 
behavior rather than a book about the processes of learning, we should 
study this question in detail. Let us, for now, take an example of the im- 
portance of discrimination in natural animal life as an indication of the 
role of these factors in survival. 

There is a variety of caterpillar that survives because of the keen discrim- 
inative learning of its natural enemy, birds. This caterpillar, the larval stage 
of the Cinnabar Moth, is brilliantly striped in black and yellow. Every young 
bird tries one of the caterpillars, and rejects it because, apparently, it is 
highly distasteful (Windecker, cited in Tinbergen, 1953). Thereafter the 
birds leave this caterpillar alone. Thus, the caterpillar sacrifices some of 
its kind in order to instruct birds in a discrimination highly useful to itself. 
Certain species of insects, moreover, mimic the appearance of distasteful 
insects, and therefore rely, to the extent they resemble the creatures they 
mimic, on stimulus generalization (Tinbergen, 1953). 

As you can see, there is a vast area in natural history in which the notions 
derived from laboratory experiments on discriminative learning might well 
be put to use. There is, indeed, a recent movement to put to use in wildlife 
preservation ahd control some of the principles obtained from the laboratory 
study of animal behavior (Smith and Geis, 1956). 


CHAPTER 5 


MOTIVATION AND LEARNING 


For a long time psychologists have made a distinction between the things 
we earn, our habits, and the things which prompt us to use these habits, 
our motives. Motives provide the internal impetus behind behavior (needs 
or drives) and the direction the behavior takes (goals); learning provides 

e means by which the organism finds a way to reduce or eliminate need 
by coming into contact with an appropriate goal. 

Motivation is a construct, not something directly observable in animal 
e avior. In other words it is an idea we invent to describe the impetus that 
makes animals active. Presumably it would be possible to invent a satisfac- 
tory theory of behavior that never mentioned the word motive or another 
word like It, but no such theory exists at the present time, and, indeed, 
psychologists have used motivational constructs very freely. Nearly all 
psychologists agree that it is necessary to make use of a construct that 
emphasizes the energy behind behavior. In this chapter we shall discuss the 
relationships between the motivational constructs and learning. In doing so, 

we shall view mo^tion.as the encrgi?if,r of behavior and learning, as -it& 
guide. 


MOTIVATION AND PERFORMANCE 

Motivation has at least two fundamental components, the need state and 
the goal, or external stimulus, that tends to eliminate or reduce that need. 
That IS to say, for nearly every need there is some goal that wUl satisfy or 
reduce the need. Needs may be cyclic, or periodic, in character, in which 
case they usually depend upon changes in the internal, physiological balance 
of the organism. Hunger and thirst are such cyclic needs. Not all needs are 
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governed by internal changes, however. Some exist at a more or less con- 
stant level, awaiting only the occurrence of the appropriate external stimulus 
to set off the appropriate behavior. The avoidance of pain is such a need, 
and in ail animals there are many other such needs, needs we ordinarily 
characterize as instincts. 

When a need is aroused, either by internal changes or an appropriate 
external stimulus, the result is usually that the animal performs some be- 
havior designed to reduce or eliminate the need. Thus, when an animal 
finds its leg in the jaws of a trap, it struggles to free itself; or w'hen it is 
thirsty, it finds its way to a familiar watering place. A good many, though 
not all, of these patterns of behavior are learned. Furthermore, animals 
learn to anticipate these need states before they actually occur, and to act 
accordingly. Thus, animals learn to avoid traps, and they sometimes find 
their way to food or water when they are not hungry or thirsty, or at any 
rate they seem to learn to stay in the vicinity of water and food supplies. 

Activity and motivation. The classical theory of needs says that they 
arouse animals to activity. As a matter of fact, there are some well-estab- 
lished relationships between needs such as general hunger and activity. 
Activity increases directly as a function of the number of hours that rats 
have been deprived of food (Siegal and Steinberg, 1949). There is some 
point, when the deprivation becomes severe enough, at which activity drops 
off (Skinner, 1938). Rats, when allowed to exist upon a free feeding sched- 
ule, will show regular cycles of activity and eating. A period of quiescence 
is follow'ed by some activity which culminates in eating (Richter, 1927). 
Eating itself is followed by a period of subnormal activity (Finger, 1949). 

It has generally been assumed that all of these effects associated with the 
arousal of need for food are direct, unlearned effects of the physiological 
consequences of deprivation. Recently, however, several investigators have 
suggested that the increased activity and other things associated with hunger 
are partly the result of learning. The cycle of quiescence-activity-eating- 
quiescence, for example, docs not occur in rats when their environment is 
carefully held constant (Baker, 1954). Furthermore, activity can be in- 
duced by sudden changes in environment (Campbell and Sheffield, 1953) 
and there is strong evidence that such activity is, in part, the result of a 
discrimination based upon the cues associated with eating (Sheffield and 
Campbell, 1954). Some of the ordinary activity-arousal effects of such 
needs as hunger are partially the result of learning and experience. This is 

one demonstration of the fact that the behavior patterns aroused by needs 
are modified by learning. 


the psychology of learning 

It is well known that other conditions of need, such as sex, maternity, 
lowered temperatures, have specific and general influences upon activity. 
That is to say, there are many different organic conditions that produce 
changes in the general activity level and that induce animals to perform 
both unlearned and learned acts specific to each organic condition. Most 
of these specific effects are not of critical importance to the psychology of 
learning, however, and we shall largely ignore them. 



Figure 20. Rate of responding under partial reinforcement as a function of amount of food just 
eaten. The lower curves are based upon averages of data from four rats each. The upper curve is 
an average of the lower corves. (Skinner, 193B.) 


Performance of learned acts and needs. As we might expect, unmo- 
tivated animals in almost any kind of learned task will make many more 
mistakes and in general perform less adequately than motivated ones. Even 
after rats have been thoroughly trained in a T maze, they will choose the 
alley that leads to food only about 50 per cent of the time if they are not 
hungry (Strange, 1950). Similarly, the rate at which rats in a Skinner box 
press the lever depends upon their state of hunger at the moment. Figure 20 
(Skinner, 1938) illustrates this point. It shows the rate of responding after 
the rat has eaten various amounts of food. The more food the rat eats be- 
fore the experiment, the lower its rate of response. This general effect is also 
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true whether or not the rat is reinforced. Figure 21 shows the rate of 

responding during extinction. We can see that the larger the amount of 

food eaten Ijy the animals before extinction, the lower the initial rate of re- 
spending during extinction. 

This increase in the level of goal-oriented behavior with an increase in 
the basic need seems to be generally characteristic of needs. It is true of 
thirst as well as hunger, and probably other needs as well. Thus, it appears 
that a need state and a reinforcement are both necessary for the emission 
of a learned instrumental response. We shall consider the implications of 
this for the theory of behavior shortly. 



Figure 21. Examples of extinction curves for the tever-pressing habit in rats. The number shows 
how much food (in grams) was fed to each rat prior to extinction. Notice that the initial rate during 
extinction is much higher after no food than after 6 grams of food. (Skinner, 1938.) 


Before we consider other topics, it is weU to settle one important question 
about the influence of need upon learned behavior. This concerns discrim- 
ination. We know that increasing the level of need increases the tendency 
to perform an appropriate learned act. But does it also increase the sharp- 
ness of a learned discrimination? Suppose we have trained an animal to 
perform some discriminative act in which reinforcement (food) is given 
for pressing a lever in the presence of a light and not pressing in the ab- 
sence of the light. If the level of need is then increased by starving the ani- 
mal for a period of time, we find that the rate of responding in the presence 
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of the light is greater and that responding in the absence of light is also 
greater. Thus it appears that while increased need raised the absolute level 
of responding it did not improve the discrimination; i.e., it did not change 
the ratio of responding to the reinforced stimulus, over responding to the 
nonreinforced stimulus (Dinsmoor, 1952). Thus, heightened need does not 
appear to increase the sharpness of a discrimination, though it will increase 
the absolute tendency to perform the appropriate response. 

Performance of learned acts and irrelevant needs. We have just seen 
that increasing the level of need results in greater general activity and in a 
greater tendency to perform acts which have in the past satisfied these 
needs. The question arises as to whether needs irrelevant to a particular 
pattern of behavior also tend to increase the tendency to perform that be- 
havior. Thus, if we have trained a rat to press a lever in order to receive 
food, will making the rat thirsty increase the tendency to press the lever? 
Hull (1943), in his first important book on behavior theory, predicted, from 
his theory, that irrelevant needs would so affect behavior. The experimental 
literature does not clearly support Hull in this, however, and it will be 
worthwhile to see exactly how this is so. 

Some experiments show that after an animal has been trained under one 
kind of need, increasing the level of a different kind of need increases re- 
sistance to extinction (Webb, 1949; Brandauer, 1953). This has generally 
been interpreted to mean that the level of an irrelevant drive does affect 
learned acts specific to another drive. There is, however, a serious flaw in 
these experiments. The animals in these experiments were trained when 
they were hungry. Before extinction they were made thirsty by deprivation 
of water, but were given complete access to food. We know, however, that 
rats will not eat much food when they are thirsty, and so, during extinction 
the animals in these experiments were probably slightly hungry as well as 
thirsty. This interaction between hunger and thirst has been very carefully 
studied, and we know that it can account for the effects in studies of ir- 
relevant needs such as those above (Verplanck and Hayes, 1953). 

In order to investigate this problem adequately, it is necessary to use 
drives that do not interact upon one another. Strange (1954), taking ad- 
vantage of the fact that albino rats show a strong aversion to moderate 
light, trained a number of them to press a lever in order to turn off a light 
for a period of time. He then studied the effects of different levels of hunger 
and thirst upon the extinction of this light-aversion habit. The results clearly 
showed that changing the level of hunger or thirst had no effect at all upon 
the resistance to extinction of the light-aversion habit. Thus, it looks as 
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though when possible interactions between needs are controlled, there is 
little or no effect of irrelevant need upon habit. 

Other kinds of motives. We must not assume that all of the motives of 
animals are based upon conditions of organic need. As we shall see later, 
there is some indication of an important class of learned motives. Further- 
more, it now appears that there are many basic, unlearned motives that are 
not directly tied to some condition of oiganic need. After a long period in 
which theorists tried to reduce the motives of animals to a small list of 
organic need, several investigators simultaneously discovered, in different 
animals, that such things as visual exploration and manipulation of the en- 
vironment are strong motives. 

For example, Montgomery (1951, 1953), has shown that visual explora- 
tion is a strong motive in the white rat, which accounts for many of the 
peculiar features of maze behavior in rats. A series of studies on the rhesus 
monkey has clearly demonstrated that these animals are strongly motivated 
by curiosity and the desire to manipulate interesting objects (Harlow, 
1950a; Butler, 1953; Harlow and McClearn, 1954). Not only are such 
motives more powerful and pervasive than anyone had thought, but it 
appears that they can be extremely resistant to the effects of satiation 
(Montgomery, 1954; Butler and Harlow, 1954). Thus they are probably 
very effective in producing a variety of reinforcements in learning. While 
It is convenient to use food or water as reinforcement in learning experi- 
ments, it is not necessary. As a matter of fact, simple curiosity about stimuli 
IS sufficient motive to provide intrinsic reinforcement in the learning and 

performance of a visual discrimination in rats (Thompson and Solomon 
1954). 

Motivation and learning jointly determine performance. The factors 
that influence behavior are classified by contemporary psychologists into 
two classes, associative and nonassociative. Both factors are responsible for 
the performance of any particular instrumental act. If an animal performs a 
particular act, it is highly probable that this is because ( 1 ) it has learned 
to do so and (2) it is motivated to do so. If motivation directed toward 
the particular goal or incentive associated with the act is low, the probabU- 
ity of occurrence of the act will be low. Likewise if habit strength for this 
particular act is low, the probability of occurrence of the act will be low, 
even though the appropriate motivation may be present. 

Hull (1943) made this joint determination of response strength by mo- 
tivation and habit a keystone of his systematic account of behavior, and 
many other theorists and experimental investigators have explored the im- 
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plications of the relationship. Incidentally, Hull formally specified the rela- 
tionship as a multiplicative one — the effects of motivation are multiplied by 
the effects of learning in order to achieve the appropriate level of strength 
of response. In a multiplicative relationship, if either factor is absent, the 
product is zero, so there will be no response strength if either motivation or 
habit strength is zero. We cannot take this too literally, for we do not know 
what IS meant by “zero” motivation, or even by “zero” habit strength. 

Hull has shown how the joint determination of response strength by 
motivation and habit works by summarizing some data from other experi- 



Figure 22. Resistance to extinction os a joint function of number of reinforcements and hours of 
deprivation. The actual obtained data are represented by the black points. The curves are derived 
from an ecfttation based upon the theory of C. L. Hull. (Penn, 1942.) 


menters (Perm, 1942; Williams, 1938). These investigators studied the 
influence of number of reinforcements (amount of training) and number 
of hours of food deprivation (hunger) upon resistance to extinction of a 
lever-pressing habit. The results can be seen in Figure 22. This figure shows 
something of the relationship between habit and motivation, although it 
seems likely that the exact nature of the relationship would be modified for 
other habits and other motives. 

It is important to realize that this curve shows nothing of the way in 
which motivation might modify learning. It only shows how motivation and 
habit jointly determine the strength (measured by resistance to extinction) 
of a particular habit. The effect of motivation upon the rate of learning it- 
self is a fundamental problem, however, and we shall deal with it next. 
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THE EFFECT OF MOTIVATION UPON LEARNING 

In Chapter 2, we saw that one of the major conclusions to come out of 
the study of reinforcing stimuli is that factors relating to reinforcement — 
such as amount, quality, presence, or absence of reinforcement — do not 
intrinsically affect learning but, rather, performance. That is to say, rein- 
forcement seems to be critical only in getting an animal to perform some act. 
The latent-learning experiments illustrated that with a little ingenuity it is 
possible to get animals to learn without specific reinforcements being asso-', 
ciated with the act to be learned. Since motivation is simply a way of char*' 
ac terizing the in ternal state which leads an organism to respond positively 
to a particular reinforcement, it would seem likely that motivation would 
have an effect only on performance. Within limits this would seem to be 
true. Let us examine the experimental evidence on the question. 

Latent-learning experiments, In Chapter 2 we discussed one kind of 
latent-learning experiment in which hungry rats were allowed to explore a 
maze before the introduction of a reward. In general, the results of several 
different experiments of this type showed, as was just mentioned, that learn- 
ing was independent of the kind of specific reinforcement employed. Rats 
allowed to explore a maze without food reward will, under the proper condi- 
tions, show that they have learned the maze as well as rats rewarded for 
every trial. 

The kind of latent-learning experiment important to the question of mo- 
tivation and learning is a little different. The animals, instead of being 
motivated but not rewarded, are rewarded but unmotivated. For example, 
satiated rats may be allowed to explore a maze which contains a reward’ 
If the reward is food, the animals are weU fed before they are allowed in 
the maze. The result is that they do not seem to prefer to choose the path 
endmg in reward; if they do find the food, they are disinterested. Since we 
wish to know if the rats have learned the location of food despite their in- 
difference toward it, we make the rats hungry and then put them back in 
the maze. If they run the maze as weU as animals receiving an equal number 
of remforced trials when they are hungry, we should conclude that motiva- 
tion IS not necessary to learning, but only to performance 

The experiments that have been reported are somewhat more compli- 
cated than the one outlined above. Usually they have been elaborated by 
the addition of a second need state, so that animals are permitted to explore 
a maze, not just satiated with food, but thirsty. If water is present at some 
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point in the maze, animals readily learn to find their way to the water, but, 
of course, the critical problem is whether or not they will also learn the 
location of food. This was essentially the situation in a well-known experi- 
ment by Spence and Lippitt (1946). It turns out, incidentally, that Spence 
and Lippitt’s rats did not apparently learn the location of food when they 
were thirsty and reinforced by water. As can be seen, however, the same 
criticism based on the lack of independence of thirst and hunger — applies 

to this experiment as applied to the studies of performance and irrelevant 
need. 

Perhaps because these latent-learning experiments have been unduly 

complicated, the results of different experiments have been contradictory 

and confusing. Many of these experiments are critically reviewed in a paper 

by Thistlethwaite (1951), and it is unnecessary for us to discuss them in 

detail. Unfortunately, however, the basic issue has been lost in many of 

these studies. It has been pointed out (for example, Deese, 1950) that the 

fundamental issue is whether some aspect of the learning function (and 

this can be very exactly stated as a mathematical parameter) is or is not 

a function of the level of motivation. Adding several different drives and 

rewards to the latent-learning experiments merely complicates this issue. 

Because so many of the latent-learning studies are thus complicated, it will 

be well for us to turn to another kind of experiment in order to settle this 
issue. 

Experiments comparing effects of level of motivation. In the past few 
years, a large number of experiments have been designed to assess the influ- 
ence of motivation upon learning. Most of these experiments have used 
resistance to extinction as the measure of strength of response. The experi- 
ments are usually designed so that various groups of animals are trained 
to press a lever in the Skinner box under various levels of motivation and 
then all are extinguished under a common level of motivation. If different 
levels of motivation produced different effects on amount of learning, re- 
sistance to extinction in the various groups should be different, even though 
all animals are extinguished under the same motivation, because resistance 
to extinction should reflect the effects of conditions of learning. Nearly all 
of the experiments of this sort show negative results; different levels of 
drive during learning have little or no effect upon resistance to extinction 
(Kendler, 1945; Reynolds, 1949; Strassburger, 1950). One study of this 
type showed slightly different results (Finan, 1940). In this study some 
animals were trained after only one hour of food deprivation, and others 
were trained after 12, 24, and 48 hours. All animals were extinguished 
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after 24 hours. There was slightly less resistance to extinction in the groups 
trained after 1 and 48 hours of food deprivation than in the 12- and 24- 
hour groups. While the differences found in this experiment are small, they 
are interesting because they suggest that the stimuli arising from various lev- 
els of hunger show stimulus generalization in much the same way as exter- 
nal stimuli; since the animals were all trained under moderate hunger, the 
center of the generalization gradient was at moderate levels. 

One experiment (Carper, 1953) that has attempted to assess both the 
direct influence of motivation upon performance and learning in the Skinner 
box shows very clearly that resistance to extinction is very- sensitive to 
motivating conditions prevailing during extinction but highly impervious to 
any differences in motivating conditions prevailing during training. 

Experiments on other kinds of instrumental responses have shown more 
complicated results. For one thing, it seems clear that there is a residual 
effect from high motivation if animals are switched to low motivation 
(O’Kelly and Heyer, 1948; Deese and Carpenter, 1951). That is to say, 
animals trained under high motivation and then switched to low motivation 
will perform better than those always tested under low motivation. As in 
the studies of resistance to extinction in the Skinner box, however, animals 
trained under low motivation and then switched to high motivation do as 
well as those tested under high motivation all along, provided the habit is 
a simple one (Deese and Carpenter, 1951). Surprisingly enough, even in 
so complicated a habit as that required in learning a 10-unit maze, animals 
trained under low motivation do as well as animals trained under high mo- 
tivation when tested under high motivation (Hillman, Hunter, and Kimble, 
1953). 

This is surprising because, as we have mentioned, the independence of 
learning from kind and quality of reinforcement held only for simple habits. 
As Seward (1951) has pointed out, a reinforcer does not strengthen a habit 
so much as it provides an incentive for its additional use. In complex habits 
the problem is complicated by the fact that a poorer quality of reinforce- 
ment provides incentive for only indifferent performance. Since animals 
learn only what they do or experience, if there is little or no incentive to be 
active, animals will not learn much. The same argument, of course, applies 
to need or motivation; if an animal is to learn, it must be motivated at least 
enough to be goaded into activity. 

This does not mean that motivation and reinforcement intrinsically affect 
the rate of learning, but merely that if these things are missing animals will 
not be induced to perform complicated acts. In the case of the experiment 
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on maze learning outlined above, curiosity probably helped the unmotivated 

rats to explore the maze even in the absence of food or hunger. This motiva- 

lon was weaker than hunger, consequently their performance in the maze 

was poorer than that of the motivated rats. It was, however, good enough 

to permit the unmotivated rats to learn the correct choices as well as did the 
motivated rats. 


Thus we may conclude that for relatively simple habits, learning under 
low motivation is about the same as learning under high motivation. Leam- 
.ng of complex habits is modified by the influence of level of mbtivation on 
per ormance. Therefore it looks as though the associative mechanisms of 
earning are largely Independent of the motivation-reinforcement factors 
hat control behavior. These associative mechanisms are only incidentally 
brought to play in most learning situations because the combination of mo- 
tivation and the reinforcement speciflc to that motivation induces the or- 
ganism to act. Campbell and Kraeling (1954) point out that in many situa- 
tions animals may learn responses that are qualitatively different when they 
leam under different levels of drive. 


SECONDARY MOTIVATION AND LEARNED INCENTIVES 

For a long time many psychologists took the view that there were but a 
handful of unlearned motives, most of which, if not all, were biological 
needs. If one takes this view it is difficult to describe all the elaborate mo- 
tives of adult human behavior unless great emphasis is placed upon the 
eve opment of learned motives. All of the complicated motives not directly 
and obviously connected with biological motives have been considered to 
be derived from biological needs by a process of learning. 

Although today most psychologists grant an important role to learned 
motives (or at least to learned incentives), the prevailing opinion seems to 
e that the view stated above is too simple. We are less sure today that all 
unlearned motives are exhausted by a simple catalogue of biological needs. 
As we examme the behavior of animals and men in nature, it is difficult to 
untangle motives that seem to be learned from those that are not. It is pos- 
sible that the actions of all motives, even those directly rooted in biological 
needs, are modified by learning and that there are some important motives 
that are simply the result of associative learning. Even though we are no 
longer convmced that all secondary motives are entirely learned, learning 
does play an important role in the elaboration of motives, and it will be 
worthwhile to examine some of the mechanisms of learning in motivation. 
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Secondary Drives 

We saw in the chapter on reinforcement that animals sometimes come to 
behave toward a hitherto neutral stimulus situation as if it had incentive 
value. Thus, if a chimpanzee, as the result of taking part in an experiment, 
comes to behave as if it were fascinated with poker chips, or if a rat comes 
to behave as if it were mortally afraid of a flashing light, very likely we shall 
say tliat the poker chips and the flashing light have acquired secondary re- 
inforcement value. Sometimes it has been assumed that the development of 
the incentive, or reinforcement, value for The neutral stimulus has been ac- 
companied by the development of an acquired drive. There are some diffi- 
culties with this notion, as we shall see later. For the moment, however, let 
us examine some of these experiments on acquired or token rewards. 

Token-reward studies. An interesting variation in the secondary-rein- 
forcement studies is found in the studies of token rewards. The general idea 
in such studies is to choose some primary drive, such as hunger, and some 
token, such as a poker chip, that can be used by an animal to “purchase” 
food. It is not too difficult to train chimpanzees to learn to use such tokens 
in food-vending machines as well as to discriminate among the various col- 
ored chips according to value (Wolfe, 1936). Furthermore, the chimpan- 
zees can be taught new habits in order to obtain the chips. 

It is possible to get chimpanzees to accumulate such tokens for a while 
without being paid off in food (Cowles, 1937). Therefore it would appear 
that one could characterize the tendency of the apes to work for and ac- 
cumulate poker chips as a learned drive or motive. Several theorists, how- 
ever, have shown that this is a gratuitous notion (Brown, 1953; Seward, 
1953). Brown points out that the pressure to assume a learned drive for 
poker chips would have been much greater if the apes had been either will- 
mg to work for the chips when they were not hungry or more willing to work 
for the chips when they could not be exchanged for food. Thus he implies 
that the chimpanzee’s token-reward habit is based upon secondary rein- 
forcement of a hunger motive, not upon a new drive for poker chips. 

There are few studies in the experimental literature that suggest that ani- 
mals do develop new drives by association with the reinforcement of an 
old one. The results of some studies are hard to account for without the 
assumption of learned drives; perhaps the best known of these experiments 
are those of Anderson (1941a; 1941b) on what he called “externalization 
of drive.” In these, rats were required to learn a completely new maze solely 
on the basis of secondary reinforcement. These rats would learn the new 
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maze whether or not they were reinforced with food, or (and this is the 
important point) whether or not they were either deprived of food or sati- 
ated. The only requirement for learning to take place was that the new maze 
contain cues which had been associated with food in the past. Anderson 
character^ed this kind of learning as “extemalization of drive” because it 

showed that a motive had become established to an external object rather 
than an internal state of need. 

Thus It seems possible that rats (and perhaps men) can acquire motives 
for previously indifferent goal objects. There are, however, very few ex- 

therefore there are a lot of problems un- 
solved^ What would happen, for example, if some new cues were associated 
with the reinforcement not of one drive, but of several drives? Would the 
res ting earned” drive be more stable, less resistant to extinction, etc.? 
Would a partial reinforcement schedule during training make any difference 
m the strength of the learned drive? These are important questions, and 
the evidence we now have on secondary drive is not very adequate.^ If we 
are able to make use of this concept in explaining more complicated proo- 
lems in human behavior, our extrapolations would make more sense and 
perhaps be easier to assess if we had better experimental evidence in lower 
animus. It is still possible, for example, that Anderson’s “extemalization of 

rive effect represents merely the arousal of a native curiosity drive. These 
are things that should be examined more thoroughly. 

^ear as a drive. We shall have to consider in detail in the next chapter 
the question of fear as a learned response. Ordinarily fear is considered to 

an emotion, but many psychologists have pointed out that it can have 
motivaung properties. It is responsible for variations in the strength of cer- 
tain mstrumental acts, and reduction or elimination of it is apparently rein- 
orcmg. Since fear is frequently learned (though not always so, contrary to 
an impression one would get from reading certain parts of the current psycho- 
ogicaJ literature), it is an important example of a learned drive. Unlike 
secondary or learned motives built upon hunger, thirst, or some other sim- 
ilar organic condition, learned fears are relatively easy to establish and j 
examine in the laboratory. 

Miller (1948, 1951) presents us with the classical example of fear as a ( 
earned drive. A rat is introduced into an apparatus like that illustrated in ^ 
igure 23. There are two compartments in the apparatus, a white one with I, 

an electrically wired grid as the floor and a black one with a smooth solid ( 


results of ^ answering these questions 

results of their experiments show that the answers obtained will not be sLplc. 
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floor. There is a door between the two compartments; the rat can open this 
door either by pressing a lever to the side of the white compartment or by 
turning a wheel at the end of the white compartment. First of all the rat is 
shocked in the white compartment and allowed to escape into the black 
compartment. This teaches the rat that the w'hite compartment is to be 
feared and that the black compartment is safe. In order to test for a learned 
drive, however, it is necessary to find out whether the rat will learn a new 



Figure 23. Apparatus for demonstration of fear as a learned drive. One compartment is vrhite 
ond one is black. The door between the compartments can be opened either by pressing a lever or 
by turning a wheel. The rat learns to escape one or the other compartment by the association of 
electric shock-produced pain and the stimuli from that compartment. (M'llUr, 1948.) 

response to escape from the white compartment without further electric 
shocks. Miller showed that rats could learn to turn the wheel, with escape 
into the black compartment as reinforcement, much as any other instru- 
mental response is learned. Figure 24 shows that each time the rat is placed 
in the white compartment it takes less time for the rat to get around to 
turning the wheel. 

Thus it seems clear that acquired fear can serve as a motivation for in- 
strumental activity and that reduction or elimination of this fear can serve 
as reinforcement. The importance of acquired fear in emotion and conflict 
is dealt with in the next chapter. 
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r '“'•n ,he Wheel with esecp. front the .hock eontportment a. o motive. The 

rot ,s shocked m the white compartment, from which it con escape by an open door. On subsequent 
nonshock trials, the rat must learn to turn the wheel, which opens the door, in order to escape from 
he compartment in which it had been shocked. (MilUr, 194 B.) 


Theories of Secondary Drives 

Acquired fear, as outlined in the experiment presented above, is fairly 
easy to produce. Indeed, it is sufficiently simple and reliable that it may be 
used in a laboratory or demonstration in elementary psychology. Learned 
drives based upon reward are very difficult to demonstrate, however, and, 
indeed, Myers and Miller (1954) present strong evidence for the existence 
of artifacts in the experiments purporting to demonstrate them. They sug- 
gest that these experiments (like Anderson’s, mentioned above) are proba- 
bly influenced by exploratory drives or curiosity. 

Thus there seems to be a real difference in the ease with which secondary 

appetitive drives and secondary aversive drives are produced. In fact, there 

is not yet an unequivocal demonstration of the development of a secondary 

appetitive drive in the literature. Several reasons have been advanced for 

this difference, and it has become a matter of importance in theories of be- 
havior. 

Miller (1951) argues that it may have to do with the relative abruptness 
of onset of the primary drive. Aversive drives can be readily aroused by 
noxious stimuli (such as electric shock), so it is relatively easy to pair a 
particular stimulus condition with the arousal (and reduction) of the drive. 
Appetitive drives develop slowly; it takes time to become hungry. There- 
fore hunger, as a drive, cannot be neatly paired with a particular stimulus 
condition. Furthermore its reduction (by eating) is probably a more grad- 
ual and complicated affair. 
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Seward (1953) advances an argument also based on a distinction be- 
tween primary drives that have sudden and those that have gradual onsets. 
He suggests that drives with sudden onsets may be conditioned directly to 
neutral stimuli (and thus give rise to fear in the case of pain as a primary 
drive). Drives with gradual onsets, however, are learned because of the ob- 
struction or blocking of a learned response. 

This brings us to the views of Mowrer (1953). He states that any stim- 
ulus that happens to occur with some increase in drive strength produces a 
capacity in the stimulus for eliciting a token (learned) drive. This token 
drive is emotional in character and akin to fear (in the case of hunger, pre- 
sumably it would be fear of hunger, if it were possible to pair an increase 
in hunger with some simple stimulus). A token-drive decrement is reinforc- 
ing, and thus whenever a response brings about reduction of fear, this re- 
sponse is increased in strength. 

This particular notion leads to the view that fear, or at least some kind of 
general avoidance, is characteristic of all secondary drives. Thus, it would 
seem that if the less obviously biological motivation of human life is to be 
explained as secondary drive, we are driven principally by fear or at least 
by something to be avoided. This, of course, is an unwarranted extrapola- 
tion, and to say the least, does not do justice to Mowrer’s views. It does, 
however, suggest the more temperate and perhaps important view that much 
of human motivation could be conceived of as learned fear or anxiety. 
Perhaps one trouble with this view is the implication, which is hard to es- 
cape, that fear or anxiety is necessarily unpleasant or to be avoided. Yet 
it is a commonplace that human beings will deliberately place themselves in 
fear- or anxiety-evoking situations (such as dangerous speeding or riding a 
roller coaster), and it seems gratuitous to assume that people do this simply 
because it feels so good when they stop. It is very possible that fear or anx- 
iety has attractive as well as negative qualities. 

These matters we have been discussing are important to the theory of 
human motivation, and no doubt the next few years will see intensive activ- 
ity in trying to fit them to problems in human behavior. At present, however, 
they stand as hypotheses derived from animal experiments, which have the 
potential of providing us with some reliable explanations of human behav- 
ior. We cannot force them to do more at present. 

MOTIVE AND INCENTIVE CONDITIONS IN HUMAN LEARNING 

Discussion of the nature of human motivation has probably covered 
more paper than any other psychological topic. We must treat this vast liter- 
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ature rather cursorily, for the simple reason that not very much of it is 
relevant to the question of the relationship between motivation and learning 
Some of It IS, however, and a few introductory words about theories of the 
origin of human motives are necessary in order to set the stage for the em- 
pineal and experimental studies we shall discuss in detail. 


The Origin of Human Motives 


A good deal of the theorizing about human motives has concerned the 
problem of their origins. An older view was that human motivation sprang 
from a number of instincts (the number varying with each theorist) com- 
mon to all human beings and exhibited by them in varying degrees. WhUe 
e multipl^e instinct view gives some signs of recovering from the extreme 
neglect U has suffered for the past thirty years, it is still not an influential 
notion. Today, the predominant view is that human motivation stems from 
either a small number of basic urges or even one basic urge, and that all of 
our concern for family, money, prestige, social status, security, etc., comes 
rom an elaboration of this basic urge or urges through experience and 
maturation. This seems to be the fundamental viewpoint of psychoanalytic 
theorists, and of those theorists who would see human motivation built up.- 

through a process of secondary reinforcement and of secondary-drive 
learning. 


Both the psychoanalytic viewpoint and the secondary-drive viewpoint 
seem to agree that human motives go through a developmental process. 

^ difference between these theories, of course, in emphasis, 
method and implications, but their major difference as far as we are con- 

ere is that the psychoanalytic view would put greater stress on 
maturation and the secondary-drive view, on learning. 

Both of these theories, however, seem to agree that goal relationships 
ave to e learned. In other words, most of the rewards and punishments i 
uman life are not innately rewarding or punishing, but develop these J 
characteristics through learning. Thus the child’s motives change and elabo- 
rate as it comes into various relationships with its parents and siblings. 

Most psychologists are convinced that the notions of human motivation 
given to us by psychoanalytic or secondary-drive theory are too simple, 
but unfortunately there is no other systematic theory that approaches these 
in completeness and inventiveness. Thus, while we agree that they are prob- 
ably partially right— human motivations do change through experience and 
maturation— we reserve the opinion that not all of the complex motives of 
human life can be demonstrably derived by either of these theories. 
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Experimental Study of Human Motivation 

Experimental approach. Motivation is an invented concept used to de- 
scribe certain important things about animal and human behavior; it is not 
a fact of the world or an experimental variable that can be directly con- 
trolled and manipulated. We associate this concept of motivation with a 
number of variables that we can manipulate directly, however, and these 
include such things as depriving animals of food for periods of time or 
changing their bodily balance by the introduction of drugs, etc. 

In the study of human motivation we are much more limited in the fac- 
tors associated with motivation that we can manipulate directly. There are a 
few experimental studies in which the influence of food deprivation upon 
some aspect of human behavior has been studied (see Lazarus, et al., 1953), 
but these studies are few and frequently not basic to our problems. In the 
older literature the most frequent technique in the study of human motiva- 
tion has been to take advantage of the covariation assumed between motive 
state and incentive value — the assumption that most people are “more mo- 
tivated” for $10 than $1. Thus, almost all of the experimental work on hu- 
man behavior cited in P. T. Young's well-known book. The Motivation of 
Behavior ( 1935), is on the effect of incentives upon human behavior. It is, 
however, desirable to use other techniques, if for no other reason than to 
study the interaction between goal or incentive value and states of the in- 
dividual induced by other conditions of his life. 

It is extremely difficult to produce conditions in the experimental labora- 
tory that allow us to vary systematically the factors associated with human 
motivation. For this reason, in recent years, there has been an increased 
emphasis in the study of human motivation upon w'hat has been called re- 
sponse-response relationships. This means that we use some behavior of 
the individual to tell us how he is motivated, and then use this behavior to 
predict something further about an individual — perhaps how he will per- 
form on a test of learning. Thus one of the main techniques in the study of 
human motivation has been the correlational approach. There are certain 
inherent limitations in this approach, the principal of which is the inability 
to eliminate extraneous variables. Thus, there is considerable argument 
over the merits of the response-response technique, or correlational ap- 
proach, to the study of human motivation. 

Studies of incentive value. There is a large literature in industrial psy- 
chology on the relationship between various incentives and output. Most 
of this is not particularly relevant to learning, but two examples will demon- 
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strate a pomt. Kitson (1922), in a classical study, showed that industrial 
workers of long experience may not necessarily be working at top efficieney. 

espite the fact that the typesetters whom he studied had ample opportunity 
to earn their trade to perfection, the addition of a monetary incentive re- 
sulted in a great improvement in output. Similar studies (Mace, 1935) have 

shown that the addition of an incentive may delay or eliminate work deere- 
merit due to continuous work. 


Various incentives used in industrial work create many problems of a 
psychological and sociological sort. While these are fundamental to an un- 
derstanding of human behavior in industrial society, they are not directly 
important to the psychology of learning. Consequently, the techniques of 
mcentive payment in industry will not be stressed here. Suffice it to say 
that the use of incentives such as piece-rate payments or special bonuses 
does not always work as we think it should. The motivations of workers in 
modem industrial society are very complicated. It is true that the human 
being m our civilization seems to be almost universally motivated for the 
acquisition of com, but this is frequently less important than other motives. 

e may e motivated to earn a certain, minimum amount of money with 

htt e effort. He may be motivated by such things as security, social approval, 

feelings of self-respect, etc. Thus, the manipulation of money as an incentive 
may not always lead to the expected results. 

. verbal rewards, and reproofs have been intensively 

s udied. These factors probably operate less upon learning itself than upon 
performance, but there is little reason to doubt that they are powerful tech- 
niques m the control of performance level. The effects of specific verbal 
rewards and punishments are sometimes difficult to evaluate, however, be- 
cause there are enormous individual differences in the effectiveness of these 

incentives (differences probably associated with individual differences in 
basic patterns of motivation). 

Motivation assessed by a self-report inventory. Let us turn to the tech-\ 
mque of assessing motivation by means of the behavior of the individual. 

I he most commonly employed technique here is the use of self-report in- 
ventories. These consist of a series of items such as “I am sometimes trou- 
bled over what others think of me” which the individual answers by true or 
false alternatives. In the experimental study of motivation the largest num- 
ber of mvestigators have made use of a self-report inventory known as the 
Manifest Anxiety Scale (Taylor, 1956, etc.). The theory behind this inven- 
tory IS that the items m it tap an emotional responsiveness that is related 
to drive or motivational level. Those individuals scoring high on the inven- 
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tory are said to be anxious, and their anxiety state is assumed to act as a 
generalized motivator to behavior. Because there is some emotional basis 
to this kind of anxiety drive, it has been questioned whether the Manifest 
Anxiety Scale primarily reflects properties ordinarily associated with emo- 
tion or with motivation (Deese, Lazarus, and Keenan, 1953; Eriksen, 1954). 
Consequently we shall deal with many of these same problems in the chap- 
ter on emotion and learning. For the present, however, let us see how 
far a consideration of manifest anxiety as an indicator of motivation 
gets us. 

At first blush the results of experimental studies are very favorable to 
the interpretation of this inventory as a measure of anxiety drive. There 
seems to be little doubt that classical conditioned responses (either the 
galvanic skin response, or a skeletal reflex such as the eyeblink) occur more 
readily in highly anxious individuals than in individuals reporting little anx- 
iety (Taylor, 1951; Spence and Taylor, 1951; Bitterman and Holtzman, 
1952; etc.). This is what one would expect if high drive had a generalized 
effect of increasing tendency to respond. 

Beyond these studies of classical conditioning the results become less 
clear, however. For example, in a study of the frequency of eyeblinks (not 
conditioning, but simply a study of frequency of occurrence), there was no 
tendency for individuals high on the anxiety inventory to blink more fre- 
quently, even when under conditions of tension (Meyer, Bahrick, and 
Fitts, 1953). Curiously enough, however, there was a tendency for fre- 
quency of blinks to correlate with another inventory of maladjustment. 

When we turn to the study of more complicated, instrumental acts, the 
picture becomes even more confused. Here, very frequently high anxiety is 
associated with poor performance (Taylor and Spence, 1952; Farber and 
Spence, 1953). High drive leading to poor performance would, at first 
glance, seem to be contrary to the usual effects of drive, but this is not 
necessarily so. For if anxiety is a truly generalized drive it will activate in- 
correct as well as correct responses. If, as is frequently the case in complex 
human learning, incorrect responses are initially stronger than correct re- 
sponses, it is argued that the effect of increasing drive would be to increase 
the difference between incorrect and correct responses. Therefore, one 
might reasonably expect, so the argument goes, that high anxiety woiffd be 
associated with poor performance in some situations, at least initially. Fur- 
thermore, one experiment (Montague, 1953) shows that if the task to be 
learned gives ample opportunity for interference, subjects scoring high od 
the anxiety inventory do more poorly; but if the task gives little opportunity 
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for interfering effects of incorrect on correct responses, subjects high in 
anxiety do better. 

Now this seems all very well, except that a direct test of the influence of 
drive state upon the interfering tendencies of incorrect responses does not 
seem to bear the analysis out. Buchwald and Yamaguchi (1955) compared 
the effects of habit reversal in the maze learning of rats under high and low 
thiist. The results clearly showed that reversal learning was more rapid 
under high drive, despite the fact that reversals in maze learning provide 
just about the optimal conditions for interference. 

The fact is that a hypothesis about the effect of drive upon interfering 
tendencies has never been clearly stated. Taylor and Spence (1952; Tay- 
lor, 1956) say that because drive multiplies habit strength to determine 
response strength, increasing the drive ought therefore to increase rela- 
tively strong habits at the expense of weak habits. Thus, if incorrect habits 
are strong they ought to be increased more than weak habits. This assump- 
tion, however, is not at all clear from the mathematics. To be sure, mul- 
tiplying two different numbers by a constant (equivalent, according to 
Taylor and Spence, to the effect of high drive on strong and weak habits) in- 
creases the absolute difference between them, but the ratio of the two habits 
remains the same. Some theorists have assumed that any choice an organ- 
ism makes between two responses is based upon the ratio of their strengths 
(or their probability of occurrence, if the two responses exhaust the possi- 
bilities), and in this case increasing drive level would not differentially in- 
crease the strength of the stronger habit. 

Thus, while the idea of using the Manifest Anxiety Scale as an index of 
motivation is an interesting one, the implications have been somewhat dis- 
appointing. In the next chapter we shall examine some of the implications 
of using this as well as other, similar scales as indices of emotional reactivity 
to stress. As we shall see, it is probable that our current concepts of emotion 
and motivation are closely allied to each other and that any really serious 
attempt to decide whether the Manifest Anxiety Scale reflects more of one 
than the other will await clarification of the concepts themselves. 

^Another kind of anxiety inventory has been used by other investigators 
(Mandler and Sarason, 1952; Sarason, Mandler, and Craighill, 1952). This 
inventory was designed to assess the individual’s reaction to situations in 
which his abilities are under test. Does, for example, an individual display 
signs of anxiety and worry when he is taking an important examination? 

The authors of this inventory believe that it taps both motivation for 
achievement and emotional responsiveness to conditions of stress. Thus in- 
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dividuals scoring high on this inventory are not only higher in motivation, 
but also, because of past learning, are likely to exhibit signs of emotional 
disturbance which might interfere with performance. 

Generally speaking, individuals who score on the high end of this “test 
anxiety inventory do more poorly in tests of learning and performance. 
This is particularly true in the case of tests known to be sensitive to emo- 
tional interference. In some cases, however, these individuals do better than 
those scoring low; they increase their scores more with practice. This is 
probably due both to the fact that these individuals adapt to their states of 
emotionality and to the fact that they are probably more highly motivated. 

Thus, the studies of individual dilTerences in the things tapped by these 
personality inventories suggest that in human life, there is a close relation- 
ship between many kinds of motivation and emotion. These are so inter- 
twined with one another that it is probably impossible simply to say that 
higher motivation automatically means better performance. This also de- 
pends upon the kind and level of emotional reactivity. 

\yProjective tests of motivation. Another important technique in the as- 
sessment of individual differences in human motivation lies in projective 
tests. The most ambitious of these has been an attempt to measure need 
for achievement by means of the Thematic Apperception Test (McClelland, 
et al., 1953). In this test, an individual’s need is evaluated by means of the 
stories he makes up to fit pictures of individuals in various situations. In 
general, great reliance is placed upon the extent to which an individual 
shows achievement imagery. The authors of this version of the Thematic 
Apperception Test were able to bring impressive evidence to show that 
they were actually measuring individual differences in motivation. 

What about the need achievement-learning and performance relation- 
ship? One experiment (Lowell, 1952) rather clearly showed that there was 
a difference in performance during the learning of an anagrams word test. 
Those individuals high in need achievement did much better. A lest on an- 
other task, in which subjects were not expected to show learning or im- 
provement, suggested that the difference in performance associated with 
need achievement is correlated with performance level and not with rate 
of learning. This is, of course, much what we should have predicted from 
studies of animal behavior. This relationship between need achievement and 
learning and performance may eventually be of practical importance, since 
there is evidence that need achievement scores are correlated with college 

grades, even when individual differences in intelligence are taken into con- 
sideration (McClelland, et al., 1953). 
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^hus, although the use of individual differences as a technique for in- 
vestigating motivation is relatively new, we have encouraging evidence that 
we shall learn something by it. The use of personality inventory scores has 
proved to be a little disappointing, largely because they are heavily saturated 
with emotional factors. There are many possibilities, however, for the de- 
velopment of inventories relatively free from emotional components (per- 
haps something like the traditional study habits inventory will work with 
college students), and this matter is not yet closed. The need-achievement 
scoring of the Thematic Apperception Test gives great promise, and re- 
cently a similar projective test has been used, with encouraging results, in 
the study of achievement motivation (French, 1955). We do not have much 
data on the relationship between these tests and the question of whether 
learning or performance is affected by motivation, but the evidence we do 
have suggests that it is largely performance rather than learning that is 
affected by different levels of motivation in human beings. Thus, any dif- 
ferences that do develop in learning are probably secondarily due to the 
basic differences in performance level. 

Incidental Learning 

We find another important technique in the study of motivation in hu- 
man learning in experiments on incidental learning. In these experiments 
the performance of individuals who practice at a task with deliberate intent 
to learn is compared with that of individuals who practice without this in- 
tent. Let us look at a simple example of an experiment in incidental learning. 

An experiment in incidental learning. One example of incidental learn- 
ing is provided in a study by Jenkins (1933). He had students, who 
thought they were serving as experimenters, read lists of nonsense words to 
other students, who were instructed to learn the words. After the learners 
had reached a certain criterion of performance, both the subjects and the 
experimenters were dismissed and told to return 24 hours later. At this 
time both subjects and experimenters were asked to recall the lists of words. 

It turned out that both the learners and the experimenters recalled many of 
the words; the learners, however, recalled more words than did the experi- 
menters. Upon asking for reports from the subjects, however, Jenkins found 
that some of the experimenters did deliberately try to learn, but even those 
who did not, recalled many words. 

This experiment shows that ( 1 ) even without specific instructions to 
learn, human beings will form a set to learn; (2) even without this specific 
set to learn, human beings will learn. In this connection, we may note, as 
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the set to leam decreased in Jenkins’s subjects, the amount of material re- 
tained 24 hours later also decreased. Thus, while it appears that subjects 

can learn without a specific set to learn, they seem to learn more adequately 
when they have such a set. 

Analyses of incidental learning. Incidental learning is a complicated 
effect, and there have been many experiments performed in an effort to 
untangle the basic causes of the differences between incidental and instructed 
subjects. Right now', of course, we are interested in the question principally 
from the point of view of motivation. Is the lesser motivational set induced 
in the incidental learners responsible for their poorer performance? 

In order to answer this question we must look more closely at the experi- 
ments. Saltzman (1953), for example, asked whether or not the difference 
between incidental and intentional learning might be due to the difference 
in the orienting task required of subjects working under two different con- 
ditions. Translating Saltzman’s question into the context of the experiment 
outlined above, we can see that even the task of being an “experimenter” 
might interfere with the incidental learning of words. Saltzman was able to 
show that if the orienting task were allowed to interfere with the perform- 
ance of intentional learners, they actually did worse than incidental learners. 

In subsequent experiments (Neimark and Saltzman, 1953; Saltzman, 
1956) the problem was further explored; it turns out that if intentional 
learning is superior to incidental learning, this is so only under certain con- 
ditions (such as certain distributions of trials through time) and with cer- 
tain orienting tasks. Some orienting tasks given to incidental learners inter- 
fere With learning and others facilitate learning as much as does intent to 
learn. 

Further analyses of problems in incidental learning have been made by 
Postman and his associates. Like Saltzman, they (Postman, Adams, and 
Phillips, 1955) assume that the difference between incidental and inten- 
tional learning depends upon the kinds of responses people make to the 
material presented to them. Specifically, they thought, intent to learn would 
make it more likely that human learners would notice critical differences 
between various verbal items they were required to learn. The experimental 
results showed that when items to be learned were easy to discriminate 
there was not much difference between incidental and intentional learning’ 
but in those cases in which the items to be learned were unfamiliar and low 
in association value, there was a large difference. 

Other experiments in this same series (Postman and Adams, 1956- Post- 
man, Adams, and Bohm, 1956) further emphasize the role of intent to leam 
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in determining the kind of responses made by the learners and whether or 
not these will be appropriate to the task of learning. Thus it is fairly clear 
that the comparison between incidental and intentional learning is not sim- 
ple, since the different effects these two conditions have upon performance 

wUl depend upon the nature of the task to be learned and the learner’s ori- 
entation to the task. 

Incidental learning and performance versus learning. What do the ex- 
periments on incidental learning tell us about the comparative effects of 
motivation upon learning and upon performance? It is clear from these 
experiments that what is learned depends upon the given set for learning. 
To the extent that a quantitative index of learning depends upon what is 
learned, the amount learned will depend upon the set for learning. Thus it 
is not surprising that an attempt to find an analogy in human learning to the 
latent learning found in maze learning of rats was not too successful (Post- 
man and Tuma, 1954). The troublesome thing about this experiment seems 
to be that it is difficult or impossible to devise conditions in which subjects 
perform the correct responses for learning the task. It is probably unfair to 
push the analogy to maze learning in rats too far, but it is much as if an 
unmotivated rat refused to run through the maze and thus could hardly 
learn the maze, latently or otherwise. 

Therefore, it is clear that unmotivated people sometimes do not learn 
very well, not because of some intrinsic connection between motivation and 
rate of learning but because they do not perform the responses actually to 
be learned. Some years ago there came to the attention of the author a 
youngster who had completely failed to learn the mechanics of subtraction 
in school. He was a bright boy, but evidently during the school hours de- 
voted to subtraction he had hit upon the expedient of writing down numbers 
more or less at random in order to satisfy the superficial requirements of 
the classroom. Thus, this unmotivated youngster failed to learn subtraction 
for the simple reason that he would not go through the activity necessary to 
the learning of subtraction. In a practical way, motivation may be impor- 
tant to learning simply because it is important to the performance of the 
right responses. 


CHAPTER 6 

EMOTION AND LEARNING 


It IS apparent to everyone that learning plays an important part in emo- 

tmnal and affective behavior. Most of our likes and dislikes are learned 

We all know that many, if not most, fears are acquired. We know about the 

importance of experience in the development of personality disturbances. 

In this chapter we shall not consider all of these things, but rather, look at 

some of the basic principles we must use when we apply the psychology of 
learning to problems in emotion. 

In applying the psychology of learning to problems in emotion, some new 
and important issues arise, and it is these issues to which we shall largely 
direct our attention. The problem of emotional learning does not involve 
merely the same principles we have discussed already; it involves new prin- 
ciples, as well as new ways of considering those we have already looked at. 
Before looking at the application of the psychology of learning to emotion 

however, one basic point about the concept of emotion needs to be men- 
tioned. 

Even the most superficial examination of the biological and psychological 
characteristics of emotion shows that it is related to the concept of motiva- 
tion. Motivation, as it is most often used by psychologists, emphasizes the 
activating role of central mechanisms in behavior. It has been apparent for 
a long time, however, that the factors associated with the activation of be 
havior are also associated with feelings of pleasantness and unpleasantness 
as well as with expressive behavior in rage or fear (Lindsley, 1951 ) Recent 
anatomical and behavioral work has shown the intimate interaction between 
the central neural mechanisms associated with motivation and those asso- 
ciated with emotion (see Stellar, 1954). 
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It is certainly apparent that some kinds of emotional activity accompany 
powerful motivation. We saw in the last chapter, for example, that a learned 
fear can motivate rats to learn a new instrumental act. The fear is clearly 
an emotional state, but it is also a motivational state. It turns out that a 
large portion of the experimental literature on the relationship between 
emotion and learning is devoted to an analysis of the motivating properties 
of such powerful, unpleasant emotions as fear. 

Before we consider the motivating aspects of emotion, however, let us 

see how emotions themselves are learned at the most elementary level. In 

order to do this, we shall examine some studies of emotional learning in 
animals and very young children. 


CONDITIONING AND EMOTIONS 

An important point about the psychology of emotions is that it has al- 
ways been associated with the activity of the autonomic nervous system— 
that part of the nervous system controUing the actions of the heart, glands, 
and smooth muscles. The activity of the autonomic nervous system is not 
voluntary, and the elements of emotional expression autonomically con- 
trolled are automatic and reflex-like in character. Thus, acceleration of the 
heartbeat, sweadng, blushing, etc., are all things that occur more or less 
automatically, given the presence of some emotion-producing condition. 

This involuntary characteristic of elements of emotional expression is 
important, because it suggests that the conditioning of these responses is 
classical rather than instrumental. In the first chapter, we saw that in classi- 
cal conditioning responses were elicited directly by stimuli. An uncondi- 
tioned stimulus elicits a response and this response can be transferred to a 
hitherto ineffective stimulus simply by pairing that stimulus with the uncon- 
ditioned stimulus. In instrumental conditioning, the response usually occurs 
spontaneously there is no stimulus that seems to elicit it — and this re- 
sponse is learned (occurs more frequently) when it is followed by some re- 
inforcement or reward. Thus, since emotional activity seems to be elicited 

by some more or less well-defined stimulus, it can be conditioned by the 
classical conditioning technique. 

Because of the reflex-like characteristics of emotional behavior, the first 
applications of Pavlov’s technique in American laboratories were to the 
conditioning of emotional behavior. Let us look at some of these early at- 
tempts to apply the classical conditioning technique to learning emotional 
behavior. 
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Classical Conditioning of Emotions 

I The Watson and Raynor study. One of the best known of all these 
studies is by Watson and Raynor (1920). These investigators were inter- 
ested in the development of fears in infants, and they made a long series of 
observations on a number of infants. One baby boy, who was about one 
year old at the time, was the subject of their conditioning study. Previously, 
they had discovered that a number of stimuli which one might ordinarily 
think would arouse fear were not fear producing for this youngster. White 
rats, dogs, masks, burning newspapers— none of these produced fear. One 
stimulus that did produce a startle reaction followed by crying and other 
signs of emotional activity was a loud sound. Consequently, they decided 

to employ the loud sound as an unconditioned stimulus for emotional 
activity. 

In the actual conditioning procedure they paired the sight of a white rat 

(previously not fear producing) with the occurrence of the sound. A very 

few pairings were sufficient to elicit a full-scale emotional response simply 

by presenting the white rat alone. Interestingly enough, this conditioned 

fear generalized to other animals and to other furry objects, such as a wad 
of cotton. ) 

This was a rather clear and dramatic demonstration of the possibilities 
of conditioned emotions, as well as a demonstration of the importance of 
such principles as stimulus generalization in the extension of the conditioned 
reaction. Watson and Raynor thought that this experiment could serve as a 
model for the way in which the complex and irrational fears of daily life 
might arise, and they placed much emphasis upon the importance of con- 
ditioned emotional reactions in the child’s personality development. 

In the intervening years, however, there have been enough studies of the 
developmental processes in young children so that we know Watson and 
Raynor’s conclusions were a bit overdrawn. We now know that the specific 
stimuli for fear and other emotional reactions depend to a considerable ex- 
tent upon maturation and the particular developmental stage of the child 

In part, the high specificity Watson and Raynor attributed to the emotional 
reactivity of the chUd was due to his age and maturation level. 

Other studies of emotional conditioning in children. One important 
point from the work of Watson and Raynor stands up well, however We 
know from subsequent investigations that emotional reactions can be con- 
ditioned by the classical technique. Jones (1931), for example, was able 
to show much the same pattern of conditioning in a fifteen-month infant by 
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cond noned response and noticed that overt reactions to the conditioned 
imulus extmgmshed much more rapidly than signs of autonomic activity 

class cdTonJf phenomena characteristic of 

assical conditioning as external inhibition and spontaneous recovery. 

«alval° Kn investigator conditioned the 

b^eT The ch^H ^ with a 

tion!r reLtir" r' conditioned emo- 

clearlv estahr"h h concluded that his studies rather 

tfcrrlouTt ‘I’J.^PPii^^abihty of the classical Pavlovian conditioning 

techmque to the modification of emotion in chUdren. 

vJeLZ^r‘T^ t conditioning. It turns out that these sim- 

the t^r ^«"<^‘ions are extremely important in 

which emot particularly as the theory applies to situations in 

studt. ?h ? occurred. The essential point of these 

Snc^ h emotional activity is conditioned by the classical technique. 

S nee the emotional reaction is aroused originally by an unconditioned stim- 

^d I. T behavior that is, by 

^d large, beyond voluntary control. Few of us can directly elicit emotional 

acuvity in ourselves in the same sense that we can open doors and recite the 

J’trnn^^'i “voluntarily” cry, etc., but there is a 

ong belief that this is an indirect kind of control. That is to say, it may be 

possible to produce voluntary erying by thinking about something likely to 

arouse emotional activity. The crying is indirectly aroused by the presence 

Of some internal (perhaps conditioned) stimulus. It is, incidentally, pos- 

si e to make use of “lie detection” techniques simply because emotional 

reactions usually beyond voluntary control can be aroused by external 
Stimuli. ^ 


Thus, the conditioning of emotional reactions seems to be largely a proc- 

• * ^1 Organisms learn to become emotional in new situations 

simp y ecause these situations occur when they are emotional for other 
reasons. At the beginning of this chapter, however, we pointed out that 
emotions seem to have some of the properties of motivating states such as 
unger or thirst. Where, then, does this aspect of emotional reactivity fit in? 
Clearly not in the transference of emotional reactions themselves from an 
unconditioned to a conditioned stimulus, since this merely involves the as- 
sociation of the response elicited by both these stimuli. If the emotional re- 
action IS Itself motivating, then reduction of it ought to serve as reinforce- 
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ment for instrumental activity. This gives rise to some very important ques- 
tions, which we can best examine by looking at the problem of avoidance 
conditioning. 

Avoidance Conditioning and the Two-factor Theory 

Since experiments in avoidance conditioning have been at the root of 
most notions that have tried to describe the role of the motivating proper- 
ties of emotions in learning, we shall do well to begin our discussion by 
looking at this kind of conditioning. 

An example of avoidance conditioning. Avoidance conditioning causes 
much difficulty because it looks very much like classical conditioning, al- 
though, as we shall see, the resemblance is misleading. A simple example 
will illustrate this point. 

A dog is trained to stand perfectly still in an experimental apparatus 
which permits the careful recording of the movements of all of its legs. One 
of its rear legs rests upon an electrode that completes a circuit through the 
dog, so that it can be shocked through this leg. If the dog lifts its paw, of 
course, the shock is terminated. In the conditioning procedure, a buzzer 
sounds about two seconds before a brief shock is applied to the dog’s leg. 
Soon after the conditioning procedure has begun, there are obvious signs 
of emotional disturbance elicited by the buzzer, and this makes it look very 
much like a simple example of classical conditioning. After a bit, however, 
the dog begins to lift its paw when the buzzer comes on. If the paw is kept 
up until the buzzer stops, the dog will not get shocked. After quite a few 
training trials, the dog lifts its paw to every sound of the buzzer, and thus 
never gets shocked. Therefore, it is quite clear that the paw-lifting response 
is instrumental; it prevents the dog from being shocked. 

This experiment starts out as an example of classical conditioning, but it 
clearly ends up with the characteristics of instrumental conditioning, for the 
response permits the dog to avoid the electric shock. 

This experiment could have been turned into an example of a “pure” 
classical conditioned response by strapping the electrodes to the dog’s leg 
so that it could not escape or avoid the shock by flexing its leg. Would this 
have made any difference? Certainly it would have influenced the frequency 
with which the leg response was part of a conditioned response. A rather 
well-known experiment with guinea pigs illustrates this point. In this experi- 
ment (Brogden, Lipman, and CuUer, 1938) a buzzer was sounded a num- 
ber of times, each time just before the guinea pigs received an electric 
shock. These animals were shocked in a revolving cage, and half of them 
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avoided the shock if they began to run when the buzzer was sounded. The 
remaining animals were shocked whether or not they ran. The results of 
this experiment can be seen in Figure 25. As this figure clearly shows, the 
animals that escaped or avoided the shock learned to run very quickly. 
Those conditioned exclusively by the classical technique, however, did not 
increase their frequency of running to the sound of the buzzer ve^ much. 
This does not mean that they did not give “emotional” conditioned re- 



Figure 25. A comparison of classical and instrumental conditioning of an operant. The solid line 
represents the frequency of conditioned responses to a buzzer when the animals (guinea pigs) are 
shocked for not running. The broken line represents frequency of conditioned responses when the 
buzzer is always followed by shock, whether or not the animals respond. (From Broaden, Lipman, 

and Culler, 1938.) y / P 


sponses on these trials, it means merely that the instrumental act of running, 
since it was not reinforced, did not dramatically increase in frequency. 
Probably the increase in running that does occur under the classical con- 
ditioning technique is the result of the high level of activity that is part of 
the emotional conditioned response.^ 

The paradox of avoidance conditioning. For a long time theorists wor- 
ried about an apparent paradox in avoidance conditioning. When an animal 

^ There is no implication in this experiment that classical conditioning is necessarily 

poorer than instrumental conditioning of the same response. See Kimble, Mann, and 
Dufort (1955). 
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escapes a noxious unconditioned stimulus such as shock by responding, it is 
rewarding. But, why should avoidance of the noxious stimulus be reward- 
ing? How can a stimulus not experienced by the animal be said to be a 
source of satisfaction or drive reduction? 

Common sense would say, of course, that the animal comes to anticipate 
the shock and that is why it responds to the buzzer. The learning theorist is 
not likely to be satisfied with this answer, however, since he will want to 
know how and why the anticipation develops. A number of theorists have 
attempted to answer this question, and one of the most widely accepted so- 
lutions is by Mowrer. Let us look at his notion about how avoidance con- 
ditioning develops. 

Mowrer's two-factor theory of avoidance conditioning. Since Mowrer 
(1947) argued that avoidance conditioning is a two-stage process, his 
theory is known as a two-factor theory. He says that first of all the animal 
learns to give a classical conditioned emotional response to the conditioned 
stimulus. This, of course, is a matter of pure involuntary association; it 
does not depend on the animal's voluntarily selecting a response because it 
is drive-reducing. As we saw in the last chapter, however, emotional dis- 
turbance or fear produced by a neutral stimulus is itself a drive. If the ani- 
mal performs an instrumental response that reduces the emotional disturb- 
ance, this is a reinforcing state of affairs and in subsequent situations the 
animal will again choose this response. Thus, if flexing the leg to escape 
shock also serves to reduce slightly the emotional responsiveness to the 
tone, it will be a response preferred by the animal. 

In a nutshell then, Mowrer’s theory says that the animal avoids the shock 
not just in order to avoid it, but to escape from the emotional state now con- 
ditioned to the buzzer. Actually, Mowrer was able to support his argument 
with some important evidence. What would happen, for example, if we 
altered the avoidance situation slightly? Suppose, instead of the buzzer al- 
ways being sounded until the shock came on, the buzzer was terminated if 
the animal anticipated the shock by giving an instrumental response. Mow- 
rer and Lamoreaux (1942) showed that, if this were the case, learning of 
the avoidance response would be more rapid. Thus, if it is possible for the 

animal to escape the conditioned stimulus, learning to avoid the uncondi- 
tioned stimulus is easier. 

Because the buzzer is contiguous with the shock, to summarize the mat- 
ter, it comes to elicit emotional reactions that are unpleasant to the animal. 
Some response terminates the buzzer, and the emotional disburbance is 
partially relieved. This relief constitutes need reduction, which is reinforcing 
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for the instrumental response. So really, one might say, the animal responds 
because the response terminates the unpleasant emotional activity con- 
ditioned t 9 the buzzer. Such behavior has the appearance of anticipatory 
behavior— and Mowrer believes that he has solved in part the problem of 
theory behavior— which has always been a stumbling block to learning 

We need not necessarily accept the distinction between contiguity con- 
ditioning of emotional (primarily autonomic) responses and need-reduc- 
tion conditiomng of instrumental responses, as does Mowrer. The need- 
reducing properties of the reinforcement for the instrumental response can 
be thought of simply as the instigator. Mowrer’s theory does necessarily 
imply, however, that the conditioned emotional response must occur before 

the animal can learn to avoid the unconditioned stimulus by making an in- 
strumental response. 

More about the two-factor theory. The two-factor theory has had enor- 
rnous implications for the theory of behavior. It has suggested, to some bold 
theorists, that many of the mechanisms of neurotic behavior are learned in- 
strumental responses used by the individual to reduce in part some learned 

fear or anxiety. Thus it is important that we examine this notion in some 
detail. 

Solomon and Wynne (1954) have extended the two-factor theory to 
take into account some experimental data they believe to be important. 
One of these is the problem of extinction of the emotional reaction. If an 
avoidance habit is well established, the animal no longer receives painful 
stimulation, therefore the emotional reaction to the conditioned stimulus 
ought to extinguish and thus indirecUy weaken the instrumental response. 
Solomon points out, however, that things do not always happen this way. 
Sometimes avoidance responses are extraordinarily resistant to the effects 
of extinction, and furthermore latencies of the conditioned avoidance re- 
sponse keep getting smaller even after the animal has received its last shock 
(Solomon, Kamin, and Wynne, 1953). One explanation for this, suggest 
Solomon and W 5 'nne, lies in the fact that the animals perform the instru- 
mental response extremely quickly when the conditioned stimulus appears; 
so quickly, as a matter of fact, that there is no time for the conditioned 
stimulus to elicit the full-blown emotional response. Thus, they argue, the 
emotional response is conserved by extremely rapid instrumental respond- 
ing. Since this responding does not allow anxiety to develop, it is not fully 
reinforced, and the animal may lose interest in responding rapidly. Every 
so often, therefore, the reaction time for the instrumental response is rela- 
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Lively long. It is protracted enough to elicit a full-blown emotional response, 
and when this happens the animal does perform the instrumental response 
and is reinforced. In addition, Solomon and Wynne argue that intense emo- 
tional conditioned responses are irreversible (not subject to extinction ) . This 
is an interesting suggestion, and we shall return to it later. 

Some of the further experimental implications of the two-factor theory 
are being intensively explored. Additional work by Mowrer and his associ- 
ates (Mowrer and Solomon, 1954; Mowrer and Aiken, 1954) has estab- 
lished the importance of temporal relationships between conditioned and 
unconditioned stimuli in the establishment of fear, and the freedom of such 
a response from any drive-reducing mechanism. Some difficulties with the 
two-factor theory do crop up. however. For example, Dykman, Gantt, and 
Whitehorn (1956) find that a component of the conditioned emotional 
response, conditioned heart rate changes, varies rather nicely with the in- 
tensity of the painful unconditioned stimulus, but the differentiation of this 
emotional response does not precede the differentiation of an instrumental 
avoidance response to the conditioned stimulus. Thus, the differences of 
instrumental responding to different intensities of painful stimulation do 
not seem to be dependent upon prior differences in conditioned emotional 
states to these intensities as perhaps they should be according to the theory. 
We may expect to see a good deal of experimental work on this and related 
problems in the two-factor theory in the next few years. 

THE ROLE OF EMOTION IN CONFLICT 

There is a widely held notion that conflict between the anxiety-producing 
aspects of certain goals and their positive attractiveness is responsible for 
certain disturbances in behavior. Stated more generally, conflict is regarded 
as important in behavioral disorders. Consequently there have been rather 
intensive laboratory studies of the nature of conflict in an effort to establish 
a relationship between the theory of conflict and more general theories of 
learning. In this section our aim will be to see how this experimental work 
has managed to determine a relationship. 

The Basic Principles of Conflict 

Lewin (1931, 1935) was responsible for the basic analysis of conflict 
behavior. His analysis, of course, was within the framework of general no- 
tions about fields of psychological forces responsible for the psychological 
stale of the organism. Lewin has made a distinction between three types 
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of conflict. These are (1) approach-avoidance conflict, (2) approach- 
approach conaic., and ,3, avoidanccavoidancc conflic,. AppLchTvSi 
an^ conflict occurs when the same stimulus object is both desirable and 

faced wit^ i conflict, the organism is 

nal y, m avoidance-avoidance conflict, the organism is faced with stimuli 
tha are only to be avoided-I can go to the dentist and suffer in the chair, 
or I can avoid the dentist and endure a toothache. 

bv examined the implications of these definitions 

by rnakmg certain hypotheses about the relative strengths of approach and 

avoidance gradients. Miller points out that there are four principles funda- 
mental to the analysis of conflict behavior. These are: 


L The tendency to approach a goal is stronger the nearer the organism is 
to the goal. ® 

2. The tendency to avoid an unpleasant stimulus is stronger the nearer 
the subject is to the stimulus. 

3. The tendency to avoid increases more rapidly with nearness to the un- 

p easant stimulus than does the tendency to approach with nearness to a 
goal. 


4. -raese gradients of approach and avoidance are dependent upon the 
strength of the particular drive. With stronger drives, the tendencies to ap- 
proach goals or avoid unpleasant stimuli are much stronger. 

Brown (1948) was able to show that these principles do apply to the be- 
avior o rats. He trained a group of rats to run down a straight alley to food 
associated with the presence of a light. The animals wore a light harness 
which could be attached to a calibrated spring, so that if the animals were 
momentarily restrained, the strength of pull could be measured. In the ex- 
periments, Brown measured the strength of pull at two different points in 
tea ey, at one point near the goal and at one point far from the goal. 
For a study of the avoidance gradient, the animals were shocked in the 
goal box and then the strength of the tendency to avoid the goal box was 
measured at two different points in the alley. 

The results of Brown’s studies can be seen in Figure 26. The avoidance 
gradient is steeper than the approach gradient, and increasing the drive, either 

by making the animals hungrier or by increasing the shock, raises the level 
of both the approach and avoidance gradients 
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Approach-avoidance conflict. If the same goal were made both attrac- 
tive and unpleasant, one would expect from the results of Brown’s experi- 
ments that the animals would approach part way and then stop and vacillate 
because the avoidance and approach gradients had crossed. Where this 
happened they would be about equal in strength, and thus the animals 
would be subject to an almost equal pull in opposite directions. This is 



DISTANCE (CMs) FROM REINFORCEMENT 


■ 0 ure 26. The solid li>K is on approach gradient. The dashed lines are avoidance gradients of 
■tterent strengths. Conflict is strongest where the avoidance gradients cross the approach gradients 
o e that the strong avoidance gradient crosses the approach grodient further from the goal than 
oes the weak ovoidanco gradient, (from Miller, 1944; data from J. S. Brown.) 


actually what happened in an experiment (Miller, Brown, and Lipofsky, 

1943). Animals ran to a goal box in order to get fed, and they were shocked 

while they ate. Thus, both an approach gradient and an avoidance gradient 

were produced. The results of the experiment showed that if the shock were 

very strong, the animals would run only a little way down the alley and then 

hesitate; if the shock were weak, they would run almost the fuU length of 

the alley before they stopped. Thus, the principles tested by Brown are 
verified in this experiment. 

A voidance-avoidance conflict. The avoidance-avoidance conflict situa- 
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tion is one of the most interesting of the cases. In this kind of conflict vacilla- 
tion is maintained the longest. Avoidance-avoidance conflict makes the organ- 
ism choose between two undesirable alternatives. If it moves away from one 
of the alternatives toward the second, the avoidance tendency to the second 
becomes greater and the organism will tend to move back toward the first 
again. This kind of situation Miller calls a stable equilibrium. Two experi- 
ments (Klebenoff, 1939; Hunt, 1943) demonstrate clearly the tendency 
toward vacillation as well as a tendency to try to leave the field. 

Approach-approach conflict. Miller points out that approach-approach 
conflict will produce little vacillation — it is essentially an unstable equi- 
librium. Once the conflict is resolved (perhaps by a purely chance choice), 
the organism tends to approach one of the goals with increasing strength, 
since, of course, as it gets closer to that goal the attractive pull of that goal 

becomes greater and the attractive pull of the alternative goal becomes 
weaker. 

Not all experimental results agree in showing that approach-approach 
conflict works this way, however. Godbeer (1940) reports considerable 
vacillation in approach-approach conflict in children. Williams (1943) says 
that this is because the child can anticipate symbolically the frustration of 
losing one of the goals, something a rat cannot do. Williams points out that 
many supposed cases of pure approach-approach conflict also involve an 
element of avoidance. It is not to be expected that approach-approach 
situations will produce no vacillation. 

Miller recognizes this, and he suggests that approach-approach situations 
sometimes involve another kind of conflict, a conflict he names “double 
approach-avoidance.” This occurs because as one goal is approached the 
other becomes more distant and the organism wishes to avoid giving up 
this goal. The net result is that the approach tendency may become rela- 
tively weak as the organism approaches the goal. The distant goal seems 
more attractive, and the organism may turn back to it. 

Further Applications of the Principles of Conflict 

The problem of displacement. One application of the principles of con- 
flict has been to displacement. Displacement occurs when an act of aggres- 
sion cannot safely be directed toward its instigator, but instead gets diverted 
to some innocent object. Thus the man who is angry at his boss inhibits 
expressing aggression to the boss and instead, expresses it to his secretary. 
Displacement would occur only if we assume that the response of inhibiting 
the aggression (avoidance) shows a much steeper curve of stimulus general- 
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ization than the displaced (generalized) aggression response itself. Miller 
and his associates (Miller and Kraeling, 1952; Murray and Miller, 1952) 
were able to establish that this is the case for a simple maze habit in rats. 
Thus, there is considerable justification for extending the analysis of conflict 
to the problem of displaced reactions. 

Conflict and anxiety. In conflict that involves avoidance tendencies, 
there is nearly always some level of conditioned emotional behavior present. 
This is because, as we saw earlier, the avoidance tendency itself depends 
upon the conditioned emotional reaction. Usually avoidance means the emo- 
tion of fear or of anxiety, so in any conflict involving avoidance one of 
these is present. In addition there has been a widespread belief that the con- 
flict per se is the source of additional fear or anxiety. We shall examine this 
notion when we come to look at the problem of experimental neuroses. It 

serves to point out, however, the great importance of the analysis of the 
conditions of conflict. 

The spatial analogy in the analysis of conflict. Most of the experiments 
on conflict we have discussed are experiments in which actual spatial dis- 
tance from the goal was an experimental variable. The space, in these cases, 
is physical. However, from Lewin has come the notion that conflict can be 
conceived of in the framework of forces within a psychological space. This 
notion of psychological space is implicit in much of our thinking about con- 
flict. For example, when we speak of avoiding the dentist versus avoiding 
the toothaches as an example of conflict, we implicitly conceptualize this 
situation in an imaginary psychological space. We see the individual as a 
point in this space surrounded by barriers and buffeted by two opposing 
forces. The analogy to experiments on rats in physical space would have 
much greater force if we could deal with this imaginary psychological space 
tn the same way that we deal with physical space. This is one of the great 
unsolved problems in the analysis of conflict, and the fruitfulness of this 
approach to more general cases of psychological conflict will depend upon 
the degree of success we have in solving this problem. 

THE PROBLEM OF PUNISHMENT 

Behavior is punished when some painful stimulation or the threat of such 
stimulation is made contingent upon it. Thus, if I punish my dog by cuffing 

It for chewing up the newspapers, in effect I have made the cuffing con- 
tingent upon chewing newspapers. Usually punishment is administered to 
moderately well-motivated behavior in order to stop the behavior. Thus 
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punishment is a technique for one organism controlling the behavior of 
another, though, of course, some particular behavior of an animal or human 
being may be inadvertently punished by a physical accident. In this case a 
pseudocorrelation between the behavior and its punishing consequences is 
set into the organism. 

The essential property of punishment, then, is that it is painful or un- 


pleasant stimulation applied to moderately well-motivated or highly moti- 
vated behavior. The rationale behind the social use of punishment is that it 
will stop undesirable behavior, though our modern insights into personality 
mechanisms lead us to suspect that punishment is sometimes administered 
to satisfy some need of those who inflict it. As a problem in the theory of 
learning, however, punishment is best seen as the influence of painful or un- 
pleasant emotion-producing stimulation upon behavior. 


The Classical Experiments on Punishment 

For many years experimental and theoretical work on punishment was 
dominated by the views of the late E. L. Thorndike. He was, for most of 
his professional life, an educational psychologist, and consequentiy his con- 
cern with the problem of punishment was influenced by the formal demands 
of the classroom. In a word, he was concerned with the influence of verbal 
praise and reproof upon behavior. Since his views and experiments were 
influential, let us look briefly at them. 

Thorndike's views and experiments. In his earliest writings, Thorndike 
simply adopted the common-sense view that punishment would reduce the 
tendency to repeat behavior it immediately followed (weakened connec- 
tions, as Thorndike put it). Later, however, Thorndike (1932a) examined 
the problem more thoroughly and came to another conclusion. He decided 
that punishment really had no inherently weakening effect on behavior. 
Let us see how Thorndike arrived at this rather surprising conclusion. 

Thorndike performed a number of experiments on punishment, with 
both animal and human subjects. As his experiments on human subjects are 
better known, let us review one of these. Thorndike (1932b) asked subjects 
who were not familiar with Spanish to guess the correct English equivalent to 
a Spanish word. He gave a series of such words in the form of a multiple- 
choice test. If the subject chose the correct word, the experimenter in- 
formed the subject by saying “right”; if the word was incorrect, the ex- 
perimenter said “wrong.” It was possible to find the influence of “right” 
or “wrong” by finding out, on further testing, if the subjects gave the pun- 
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ished or rewarded responses with a frequency greater or lesser than that 
expected by chance. Since there were five alternatives for each item, the 
frequency of repetition by chance would be 20 per cent. This was the base 
line against which Thorndike measured the effect of saying “right" or 
‘wrong.” As it turned out, “right” did increase the tendency to repeat the 
initially given response. The consequence “wrong”, how'ever, rather than 
being w'eakening, seemed to Thorndike to have a slight strengthening effect. 

Criticisms of Thorndike. Many similar experiments by Thorndike 
(1935) led him to the same conclusion. Other investigators leveled serious 
criticisms at him, however. First of all, they argued, the use of “chance” as 
a base line against which to measure the effects of reward or punishment 
is not good; it is extremely unlikely that responses given by subjects in such 
tests are at the chance level; rather responses go in “runs,” or sequences, 
to which the one-in-five chance expectancy would not apply. Stephens 
(1934), for example, used an empirically determined probability of repeti- 
tion by including items to which the experimenter gave no reward or pun- 
ishment. The frequency of repetition without any consequences turned out 
to be 36 per cent instead of the 20 per cent assumed by Thorndike. Meas- 
ured from this base, punishment did have a slight weakening effect (given 
the mildness of the punishment and the low motivation for repetition we 
should not expect a large effect). 

Punishment indirectly weakens. Thorndike argued, however, that 
wherever punishment appeared to weaken a response it was an indirect ef- 
fect. This happens because punishment produces variability of behavior. 
In maze learning, for example, punishment of an error may lead the learner 
accidentally to make the correct response, and this response indirectly elimi- 
nates the punished response, because the correct response is strengthened 
due to its effect. It has never been precisely clear what Thorndike meant 
by the indirect weakening of punished responses (see the controversy, 
Postman, 1947; Stone, 1948) but it would be something like this: Punish- 
ment may or may not weaken a response, but it clearly cannot be the mirror 
image of the action of a reward. If a response is rewarded, it is apparent to 
the organism in a typical experimental test that repetition of this response 
will again be rewarding, but if a response is punished, it is not clear to the 
organism which of the other available responses will be rewarded. Thus, 
behavior after punishment will be more variable, e.g., the prediction of the 
exact response to be made by the organism will be more difficult. This is 
the most general statement we can make. In the Thorndikian situations the 
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punishment is a mUd one, and perhaps its main function is to provide in- 
formation. If this is so, reproof for making a particular response does not 

supply as much information as praise, for it does not tell the subject which 
response is correct. 


Punishment in Experiments on Animal Behavior 

Many of the important conditions of punishment are ignored in the 
Thornd.k.an experiments. The magnitude and quality of unpleasant effects 
are ignored, and, perhaps more importantly, the effects of restricting the 
available responses are ignored. This last point needs some amplification 

In Thorndike’s experiments the learner may satisfy his motivations by 
making responses other than the ones punished. He is seldom interested in 
defending his original response as the “right” one, and he will readily aban- 
don It when he knows it is “wrong.” Thus, the effect of punishment is to 
produce variability of behavior, as Thorndike suggested. 

What happens, however, when the only response which satisfies a drive 
IS punished? The child who is punished for taking cookies from the cup- 
board knows no other way to satisfy a craving for sweets. No matter how 
many raps on the knuckles the child may receive, the cookies lose none of 
their attraction, and he is not likely to find another way of getting cookies. 

There are interesting experiments upon the behavior of rats that bear di- 
rectly on this problem. 

Skinner’s experiment. Skinner ( 1938 ) studied the effect of punishment 
upon the lever-pressing response in the Skinner box. In his experiment, the 
punishment was a sharp slap on the paws from the lever itself. To measure 
the effects of punishment, Skinner obtained extinction curves following pe- 
riodic reinforcement with food from four rats. Of these rats, two were 
slapped at the beginning of the extinction period and two were not. The 
result was that the rats slapped at the outset of extinction showed a much 
lower rate of responding during extinction. As a matter of fact, the punish- 
ment seemed to inhibit the response entirely for a period of time. The 
curious thing, however, is that by the time extinction had been completed, 
the slapped rats had caught up, in total number of responses emitted, with 
the rats not slapped. These results can be clearly seen in Figure 27. 

Skinner interpreted the results of this experiment to mean that punish- 
ment has only a temporary effect upon behavior. It does not affect the “re- 
serve of responses which the animal will eventually emit; it affects only the 
rate at which this reserve will be exhausted. Early in extinction the punished 
animals show a lower rate of responding, but eventually the rate for these 
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animals accelerates so that they show a total number of responses equal to 
that of the unpunished animals. 

Estes' experiments. Estes (1944) followed Skinner’s experiment with 
an intensive investigation of the factors affecting punishment of an isolated 
instrumental response — lever pressing in the rat. In these experiments he 
substituted an electric shock for the slap. The shock was delivered througlt 
the lever, so that rats used to receiving food as the result of lever pressing, 
sometimes received shock instead. 

Estes’ first conclusion is much like Skinner’s. The effect of punishment 
is primarily upon the rate of responding rather than on the over-all tendency 
to emit a particular response. Estes did find, however, that if the punish- 
ment were intense enough or lasted long enough, there was a slight 
permanent depression in the reserve of available responses. Nevertheless 
punishment was never able to eliminate the extinction curve completely — 



TIME IN MINUTES 

Figure 27. The efFects of punishment upon resistance to extinction. The initial effect of punish- 
ment is to reduce the rote of responding. There is, however, recovery, and eventually the total num- 
ber of responses Is about the some for the punished ond control animals. (SJcinner, J938.) 

rats always came back to respond after punishment, even though they were 
never again reinforced. Interestingly enough, when Estes punished his rats 
periodically instead of for every response, the depression of the rate of re- 
sponding was not nearly so severe, though it lasted longer. Thus, the results 
of periodic, or partial, punishment are somewhat analogous to the results of 
partial reinforcement. 

Punishment and conditioned emotional responses. Estes interprets the 
effects of punishment somewhat as follows: anxiety or fear aroused by the 
shock becomes conditioned to the stimuli in the experimental box. This 
anxiety, most strongly conditioned to the visual stimulus of the lever, causes 
the animal to withdraw. Withdrawal reduces the anxiety and is therefore 
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reinforcing. In Estes’ terms, anxiety becomes prepotent over the lever- 

pressing response, which consequently disappears for a time. However, 

since the animal does not press the lever, the anxiety conditioned to the 

lever extinguishes and the animal gradually returns to pressing the bar. Un- 

kss the response is again punished, the anxiety gradually diminishes, aUow- 

ing the animal to exhaust the reserve of lever-pressing responses it built up 
during food reinforcement. 

Thus, in this experiment, the effectiveness of punishment seems to come 
from an emotional reaction conditioned to the environment in which the 
rewarded instrumental response is likely to occur. The emotional reaction 
prevents, or at least reduces, the tendency for the emission of the instru- 
mental response. When the emotional reaction extinguishes, as it should 

if the animal is not punished again, the tendency to perform the instrumental 
response returns. 

Punishment, then, seems to be a rather special case of avoidance learn- 
ing. This is pointed out by Dinsmoor (1954) and he shows that a fruitful 
analysis of the possible effects of punishment can be obtained by studying 
experiments in avoidance learning. He brings out that the operation of pun- 
ishment is essentially a two-step process as it is in avoidance learning.^ 

The use of the suppression period produced by punishment. It is clear 
that punishment may not be too effective in completely eliminating some 
behavior. Punishment may momentarily weaken a response, but it does 
nothing to eliminate the behavior permanently. This conclusion seems to 
be in accord with the clinical and anthropological evidence about aggres- 
sive behavior punished by parents or by some agent of society. The tend- 
ency toward aggression is not removed until it can be brought to free 
expression, and thus special psychotherapeutic techniques are frequently de- 
vised for the express purpose of allowing aggression to occur in a displaced 
(therapeutic) situation. 

It is possible, however, to use the temporary period of response inhibi- 
tion produced by punishment to teach an organism a new method of reach- 
ing the same goal. An experiment by Whiting and Mowrer (1943) suggests 
that this may be a valuable technique in the control of behavior. 

These investigators studied ways of getting rats to abandon old, preferred 

2 Hunt and Brady (1955) have compared the effectiveness of punishment and a 
conditioned emotional response in suppressing instrumental behavior. Their results 
show that the conditioned emotional response is more effective, because of the more 
adequate stimulus control it exerts over behavior. 
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adjustments in favor of new ones. They used three groups of rats. All three 
were taught to run a simple maze. After the habit of running to the goal 
box had been acquired, they extinguished it in one group of rats by omitting 
the reinforcement. They continued to reinforce the habit for the second 
group, but placed a barrier in the path leading to food. In the case of the 
third group, they punished the running response by an electric shock. The 
situation was then changed so that all three groups could find the way to 
the goal by an alternative route. The animals that had been punished on the 
original habit showed the fewest errors in reaching the goal by the alterna- 
tive pathway. They also showed less regression to the original habit after 
the alternative habit had been learned. 

Thus, teaching an alternative response during the period of suppression 
induced by punishment could be an effective means of redirecting behavior. 
Of course, as we might expect, if the second habit is extinguished, there will 
be a tendency to revert to the original, punished habit. 

The effects of traumatic punishment. When we were looking at the prob- 
lem of avoidance learning, we saw that Solomon and Wynne ( 1954) sug- 
gested that sometimes the conditioned emotional reaction set up by painful 
stimulation was extremely resistant to extinction — indeed perhaps never 
could be extinguished. If the same thing applied to punishment, it is pos- 
sible that a punished response would never recover its strength. The experi- 
ments on avoidance learning that Solomon and Wynne were talking about 
were experiments in which very traumatic levels of electric shock had been 
used. In the experiments on punishment we have looked at thus far, the 
punishment was relatively mild. Consequently, it is possible that under a 
really traumatic punishment, the instrumental response punished may never 
recover its original strength. 

This seems to be the case in some studies of Masserman (1943) on pun- 
ishment in cats. As we shall see shortly, the experiments of Masserman 
suggest that the effects of such punishment may go far beyond the instru- 
mental response punished; there may be changes in almost all behavioral 
adjustments of the animals. This is the case in the experimental production 
of neurosis, the problem we shall consider next. 

EXPERIMENTAL NEUROSIS 

One of the interesting problems in the relationship between emotion 
and learning is provided by those investigators who have attempted to pro- 
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duce disturbances in behavior by experimental conditions in the laboratory. 

In this section we shall look at some of the experiments and their resultant 
theories. 

The Experimental Production of Neurosis in Animals 

Pavlov's experiments. Some observations by Pavlov (1927; 1928) pro- 
vide the introduction to the technique of producing “experimental neurosis.” 
While studying the conditioned salivary response, workers in Pavlov’s lab- 
oratory noticed that animals sometimes developed long-standing disturb- 
ances of behavior. Pavlov began to study these disturbances seriously after 
an experiment in which he attempted to condition salivation by using elec- 
tric shock as a conditioned stimulus. The electric shock seemed to work 
fairly well for a while, but after an attempt at generalization, the conditioned 

salivation was replaced by a violent struggle quite foreign to the usual be- 
havior of the dogs. 

A subsequent experiment led Pavlov to an intensive study of disturbances 
of behavior. This experiment started out as an attempt to test the limits of 
form discrimination in a dog. The animal was trained to salivate to the 
presentation of a luminous circle on a screen. Then an attempt was made 
to get the animal to discriminate between a circle which was reinforced and 
an ellipse which was not. After discrimination was established, the ellipse 
was gradually changed to look more like the circle. For a while, the dog 
continued to discriminate, but then as the two stimuli became more alike, 
discrimination suddenly became very poor. In addition, the whole character 
of the animal s behavior changed. The dog, which had been trained to stand 
quietly in the apparatus, began to struggle and squeal. Eventually it became 
quite violent and resisted being taken to the experimental room. This be- 
havior is what Pavlov called an experimentally produced neurosis. 

Pavlov believed that the primary cause of this disturbance of behavior 

in the test situation was a conflict in the cerebral cortex between opposed 

forces of excitation and inhibition. The excitation was produced by the 

positive conditioning of one stimulus and the inhibition by the extinction 

of the other. Both conditioning and extinction occur in discrimination, and 

when the stimuli get close to each other, the conflict between the opposing 

forces of excitation and inhibition becomes strong enough to disturb the 
whole cortex. 

Pavlov s general notions about cerebral physiology have more or less 
fallen into disrepute as we have come to know more about the facts of 
cerebral activity. His interpretation of neurosis in terms of conflict, how- 
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ever, is still influential. If we were to rephrase Pavlov’s theory in language 
less tainted with outdated physiology we might say that experimental neu- 
rosis occurs because of the inability of the animal to tell when to respond 
and when not to. This conflict could be the result of a discrimination that 
is too difficult for the animal. It should be noted in passing, however, that 
such a conflict can arise from causes other than difficult discrimination. If 
the animal must delay a conditioned response too long after the presenta- 
tion of the conditioned stimulus (because of a delay in presentation of the 
unconditioned stimulus) there will be a conflict between the tendency to 
respond and the tendency to inhibit the response until the appropriate time. 

Other investigations of experimental neurosis. Since Pavlov's original 
observations there have been many other attempts to produce disturbances 
in the behavior of animals in the laboratory. Gantt (1936) has closely fol- 
lowed Pavlov’s work, and he has been able to confirm Pavlov’s observa- 
tions on disturbances of behavior during difficult discrimination. 

In studies by Liddell (1944) the conditioned flexion response has been 
used rather than the salivary response. In experiments on sheep, dogs, and 
pigs, Liddell and his associates have been able to produce many behavioral 
disturbances in the laboratory as well as disturbances in bodily functioning. 
In these experiments there has been great emphasis placed upon the role of 
restraint of the experimental animals during testing. Restraint, monotony 
of stimuli, and other factors are responsible for the disturbances, according 
to Liddell. It is interesting to note that Liddell finds that many of these 
disturbances persist when the animals are removed from the experimental 
situation. 

Some of the most interesting observations of experimentally produced 
disturbances of behavior come from Masserman (1943), Masserman used 
an instrumental response in producing his disturbances. Cats were trained 
to open a small food box after a stimulus was presented. The techniques 
used to produce the disturbances of behavior were frustration, conflict, and 
punishment. Suffice it to say that by these techniques Masserman was able 
to produce markedly deviant behavior in his cats. Some cats became 
“phobic” toward the box, particularly after punishment. Other cats went 
into semicataleptic states, refusing to move or even to eat for long periods 
of time. Masserman attempted a variety of therapeutic devices to relieve the 
neurosis after it had been established. Rest was about the most effective 
means of allaying the animals’ symptoms. Also effective were reducing the 
hunger drive through forced feeding outside of the experimental room, “re- 
assurance,” and social imitation of nonneurotic cats. 
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Theories of Experimental Neurosis 

Today there is less emphasis upon trying to find analogies between dis- 
turbed behavior in animals and that in human beings than there was a few 
years ago. Nearly all psychologists realize that a technique used to produce a 
disturbance in an animal need not achieve the same effect in human beings. 
There is less concentration upon techniques which produce disturbed be- 
havior in animals than upon the production of conflict between two habits 
or upon the production of an inhibiting fear in animals. Let us look at some 
of the notions about disturbed behavior in animals, not with the idea of find- 
ing analogies to similar behavior in human beings, but with the idea that 

we might be seeing simple models of some of the processes at work in hu- 
man neuroses. 

Masserman's theory. Of all of the recent theories of behavioral dis- 
turbances based upon experimental studies of animals, Masserman’s comes 
closest to considering disturbances in animals to be analogous to those in 
human beings. He emphasizes the fact that traumatic situations in the ex- 
perimental tests are responsible for phobic reactions and fixations, much 
as they are supposed to be in the case of human personalities. The symp- 
toms of withdrawal and confusion exhibited by his animals, Masserman be- 
lieves, occur because of the conflict between the motive to get food and the 
anxiety associated with the punishment of the food-getting responses. 

While Masserman does not explicitly point up the relationship, his anal- 
ysis comes very close to the kind we should expect from our earlier dis- 
cussion of basic learning mechanisms in emotional behavior. In addition, 
however, he points out the many parallel points between disturbed personal 
and social adjustments in human beings and what he believes to be their 
counterpart in animal behavior. Though Masserman’s views of the mecha- 
nisms in disturbed behavior do not provide us with a systematic theory, 
they are important in that they represent the first real attempt to bring to- 
gether psychiatric and psychoanalytic theories with the experimental study 
of animal behavior. 

The Dollard and Miller theory. A systematic account of the relation- 
ship between problems of emotional learning in animals and behavior dis- 
orders in human beings is presented by Dollard and Miller in Personality 
and Psychotherapy (1950). These writers are not specifically interested in 
presenting a theory of experimental neurosis, but they are interested in ap- 
plying the basic principles of learning to the description of human per- 
sonality disorders. Though their aim is somewhat different, the general re- 
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suit of bringing together experimental observations of animal behavior and 
observations of human beines in trouble is the same. 

Perhaps the most important principle that Dollard and Miller use is 
approach-avoidance conflict. People frequently find themselves in the un- 
fortunate position of having conditioned fear responses aroused by situa- 
tions which can reduce their positive drive. In other words, there is conflict 
between fear and desires. Dollard and Miller discuss in detail the case his- 
tory of an individual who suffered from conflict between the fears instilled 
by a repressed childhood and iier normal, adult sexual desires. Since the fear 
actually prevented this woman from satisfactorily satisfying her positive 
needs, misery and anxiety arose. Furthermore the fear produced repression 
of fear-arousing thoughts and images so that the fear itself was prevented 
from undergoing extinction. The net result was a vicious circle in which no 
opportunity was given for the reduction of fear or the elimination of con- 
flict. This, say Dollard and Miller, is the general scheme for disturbances 
in personality and the production of psychosomatic symptoms. 

Dollard and Miller also apply the principles of learning to the process 
of readjustment of maladjusted people. They present an account of the course 
of psychotherapy in one individual and point out how each aspect of psycho- 
therapy works in terms of the principles of learning. They show, for exam- 
ple, how talking about a problem in a permissive and relaxed atmosphere 
could result in extinction of the fear responses and consequent resolution of 
a conflict. They likewise interpret the transference that takes place during 
therapy as a special case of stimulus generalization, and show how dis- 
crimination learning must take place if the individual is to distinguish be- 
tween appropriate and inappropriate behavior in various life situations. 

Maier’s theory. The position advanced by Dollard and Miller really 
needs no new principles to describe the mechanisms in disturbed behavior; 
the learning principles discussed thus far in this book are perfectly adequate. 
We should, however, look briefly at one theory of experimental neurosis 
that does invoke a new principle. This is the theory presented by Maier 
(1949). In experiments by Maier and his associates, rats are required to 
discriminate between two visual stimuli presented on the Lashley jumping 
stand. The stimuli are reinforced and punished in a random manner, so that 
the rats really cannot learn or find a solution to the problem. Animals fre- 
quently refuse to choose in this situation, even though they may be very 
hungry. Thus it is sometimes necessary to “drive” the animals to choose by 
stimulating them with a blast of air (which emits a piercing sound painful 
to rats) or an electric shock. When this happens and the rats are forced to 
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respond, they usually fixate on a particular response — jumping to the left 
card or jumping squarely at the middle, missing both cards. Once this re- 
sponse becomes fixed, it never changes. This Maier has called “abnormal 
fixation.” 

The animals in these experiments, Maier says, are frustrated, and this is 
responsible for the fixation. Animals, he finds, that have developed a fixa- 
tion may even refuse to jump to an open window in which food is displayed. 
Rather, they prefer their “fixated” response. Maier believes, moreover, 
that his experiments show that behavior elicited during a state of frustration 
has certain unique properties. Frustrated behavior is not motivated behav- 
ior in the usual sense; it solves no problems and has no goals. Aggression, 
regression, and fixation are all characteristics of frustrated behavior. Since 
behavioral disturbances can be described in terms of aggression, regression, 
and fixation, Maier believes that a major portion of the disturbances in be- 
havior reported in studies on experimental neurosis are due to frustration. 

The strength of the fixations reported in experiments on the Lashley 
jumping stand is remarkable. Making available the solutions to the problem 
does not break up the fixation (Maier, 1949), and even training the rats 
to walk to the correct stimuli does not eliminate it (Feldman, 1953). Strong 
punishment, persistently administered, however, does seem to break the po- 
sition stereotypes of fixations (Knopfelmacher, 1953). 

Thus Maier introduces a new concept, frustration induced fixation, and 
he, as well as others, has produced a good deal of evidence to support this 
notion. It is an interesting one, and only time and new evidence will tell us 
whether such a concept is necessary or whether all the things it purports 
to explain can be clarified by the ordinary principles of learning. Perhaps 
the greatest importance of the notion, so far as we are concerned at the mo- 
ment, is that it suggests that the laws of adaptive behavior break down under 
the stress of frustration. 

Some comments on experimental neurosis. We can see, from this brief 
survey, that many basic principles in the psychology of learning have been 
put to work answering questions about the causes of disturbed behavior. 
We have also seen that experimental work on animals sometimes leads us 
to formulate new principles which, in turn, may be applied to problems of 
disordered behavior in human beings. 

It should be pointed out that there is a vast gulf between the disturbed 
behavior of animals in the laboratory and the disturbed behavior of indi- 
viduals in trouble. In most studies of experimental neurosis, disruption of 
behavior has been produced by stimuli that elicit strong emotional re- 
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sponses. These disturbances, as well as the emotional behavior, are situa- 
tional. The animals may be disturbed in the laboratory but they are less 
or not at all disturbed when out of the laboratory. Human neuroses are not 
situational. Thus, in the matter of behavioral disturbances the rat has the 
advantage over the human being. The rat does not carry its anxieties around; 
rather its anxieties are conditioned to specific stimuli, and it shows signs of 
anxiety only in the presence of these or similar stimuli. The human being — 
through the mediation of verbal behavior — carries his anxieties about and 
uses implicit verbal behavior to reinforce them. Thus, perhaps the major 
stumbling block to a direct analogy between disturbed behavior in labora- 
tory animals and in human neurotics is the low capacity animals have for 
engaging in symbolic and linguistic behavior. Human beings can much more 
readily generalize and extend their fears and conflicts because they can 
symbolically face fear of punishment or conflict even when the objective 
situation does not demand it. This fact and others make us aware of the 
enormous gulf that exists between experimental neuroses of the laboratory 
and disturbed personalities. We should not ignore, however, the extent to 
which animal behavior in the experimental laboratory can provide us with 
models of basic mechanisms in personality maladjustments. The extraordi- 
nary degree to which the results of simple experiments can be used to 
describe personality processes is well illustrated in Dollard and Miller’s pre- 
viously mentioned Personality and Psychotherapy (1950). Nor have Dol- 
lard and Miller stood alone in this endeavor; Mowrer (1950) and others 
have explored much the same course, and our understanding of basic mech- 
anisms in the dynamics of individual personality has been enriched by 
their work. 

STRESS AND LEARNING 

Before we leave the topic of emotion we have one important problem to 
consider, the effects of emotional activity upon learning and the perform- 
ance of learned skills. Emotional activity arises when the organism is under 
some stress. Examples of stress are familiar enough — the navigator of a 
modern bomber might be troubled by the threat of physical injury and the 

need for quick, precise performance of his task to ensure the success of his 
mission. 

Emotion and Motivation in Stress 

It is obvious that stress involves components other than emotional activ- 
ity. It is a state in which some motive or motives remain ungratified; this 
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may mean failure to avoid or escape threat (fear) or it may even mean 
lack of satisfaction of a powerful appetitive desire (sex, for example). Thus 
stress means heightened motivation as well as heightened emotional activ- 
ity. What is more, in experimental studies of human behavior, it is prac- 
tically impossible to produce emotional disturbance without also producing 
a change in motivation (Lazarus, Deese, and Osier, 1952). Thus, we can- 
not examine the effects of emotional arousal upon learning and skilled per- 
formance without taking into consideration the effects of change in motiva- 
tion. This point was raised in the last chapter in the section on experimental 
studies of human motivation. A number of studies were reviewed in which 
investigators had tried to relate individual differences in motivation level 
to learning and performance on various tasks. The difficulty with these 
studies, remember, was lack of agreement as to whether or not the measures 
of individual differences used by these investigators actually assessed moti- 
vation. Some workers point out that these studies really were concerned 
with the relationship between performance and individual differences in 
emotionality and performance. 

Unfortunately there is no way out of this dilemma at present. Perhaps, 
from a strictly operational point of view, we have no justification for talking 
of the effects of emotion and motivation as being independent of each 
other, at least as far as human skilled performance is concerned. There is 
evidence from other sources, however, that emotional activity and motive 
states do derive from somewhat independent sources, and given the present 
general viewpoint in psychology, we find it convenient to talk about emo- 
tion and motivation separately. 

Roughly stated, level of motivation is responsible for level of perform- 
ance. In other words, higher motivation leads to more adequate performance 
if the latter leads to eventual satisfaction of the motive. Emotional arousal 
provides activity which for all but a small number of skilled acts is likely 
to produce interference. Trembling, gastrointestinal upset, etc., are all 
things that may distract from or directly interfere with some particular task 
an individual is trying to perform. Consequently, in the vast majority of 
specific instances, we should suppose that emotional arousal would have a 
detrimental effect upon skilled performance. 

What, then, of the effects of stress? Stress, as we saw, involves both mo- 
tive and emotional components. One very possibly would bring about im- 
proved performance, and the other, deteriorated performance. What actually 
happens when individuals are stressed in an experimental test and the effects 
of this situation upon performance evaluated? 
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Experinientol Studies of Stress arid Performance 

As the confusing roles of emotion and motivation in stress would lead 
us to imagine, there is no simple way of stating unequivocally that stress 
will have specific effects upon learning and the performance of particular 
skilled acts. There are few stable generalizations that can be made. 

One which seems to receive support from a wide variety of experimental 

investigation is that stress increases the speed with which repetitive tasks 

are carried out (Lindsley, 1946; McKinney, et al., 1951 ). This increase in 

speed is generally accompanied by a higher rate of errors. The net result 

IS that some people actually improve their performance because of the 

greater speed, and others decrease in performance because of an increase 
in errors (Lazarus and Eriksen, 1952). 

Furthermore, we suspect that individual reactions to stress are de- 
termined m part by stable personality patterns. In the last chapter we dis- 
cussed some studies of the relationship between performance on skilled 
tasks and scores on personality inventories. We saw that these personality 
inventories probably assess individual differences in motivational as well as 
emotional factors. Therefore, a thoroughgoing exploration of the relation- 
ship between personality inventory scores and behavior under stress ought 
to provide us with some interesting and analytic data about stress and per- 
sonality. A bare beginning to this study has been made (Deese, Lazarus, 
and Keenan, 1953; Osier, 1954). Perhaps studies of personality charac- 
teristics and behavior under stress will provide a means of separating the 
motivational and emotional components in the reaction to stress. 

There is tentative evidence that poorly organized habits are more suscepti- 
ble to deterioration from stress than weU-established habits. Information 
Items on an intelligence test, for example, are little influenced by stress, but 
the immediate memory span (memory for things heard or seen just once) 

IS susceptible to stress (Moldawsky and Moldawsky, 1952). Also, it seems 
likely that performance early in the learning of a new skill is influenced 
more by stress than is later performance (Deese and Lazarus, 1952). 

Beyond these simple statements it is difificult to go. It is likely that a more 
thorough analysis of the effects of stress upon learning and performance 
will depend upon new experimental techniques and new ways of assessing 
individual differences in emotional reactivity and motivational level. 


CHAPTER 7 


SERIAL LEARNING 


A basic fact of behavior is that it is sequential; acts follow one another in 
a continuous stream. One of the central problems for the psychology of 
learning is to consider how this stream of behavior becomes organized. As 
we try to solve this problem we shall draw upon many basic principles al- 
ready discussed, as well as upon new principles and ways of studying be- 
havior. Therefore, this chapter is about the possible applications of the 
basic principles of instrumental learning to particular facts of serial or 
sequential patterns of behavior, and the new techniques and methods used 
in describing such patterns. 

Many of the examples of learning we looked at in the earlier chapters 
involved sequences of responses rather than single responses. We treated 
these examples, however, as if they were single responses. In the Skinner- 
box experiments, for example, the learned behavior was characterized as a 
“lever-pressing” response, and we treated it as if only one act were necessary 
to carry out the learned instrumental response. Actually, there is an elabo- 
rate chain of responses involved in pressing the lever, a fact pointed out by 
Skinner (1938), The rat must see the lever, and this may lead to lifting the 
paws; it cannot begin the actual depression movement until it feels the 
lever; it must then seize and eat the food. Skinner described at least four 
separate responses as part of the chain involved in pressing the lever, and 
it is probable that a really detailed analysis would yield more. Certainly 
parts of the chain change as learning progresses (Hurwitz, 1954). 

We treated the instrumental activity in the Skinner box, however, as a 
single response, since we were not interested in the parts of the chain of 
behavior and how they became organized. The “response” was defined as 
something which produced an effect upon the environment — moving the 
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lever through a certain excursion. After so defining the response, it was pos- 
sible to study the effects of different variables and to treat these as if they 
were affecting only one response rather than a whole chain of responses. 
Before we can even begin to appreciate the full range of problems in learn- 
ing, however, we must examine the available evidence on the organization 
of response sequences — how responses interact with one another when 
they occur in the stream of behavior. 

In one sense, the larger portion of the remainder of this book is devoted 
to the problem of the organization of response sequences, since interactions 
between responses provide the basis for much of forgetting, problem solving, 
and transfer of training. Therefore, as w-e consider each of these problems 

separately, we shall also be examining certain special effects of response 
sequences. 


SERIAL LEARNING IN ANIMALS 

Let us first look at the problems of serial learning in animals. Since ani- 
mal behavior is less complicated than human behavior and since animals 
have little available to them in the way of symbolic processes or linguistic 
behavior, we shall find applying the basic principles of behavior to serial 
learning in animals relatively simple. Later, when we come to deal with 
human serial learning, we shall examine the new methods and principles. 

Techniques in Studying Serial Learning in Animals 

Everybody knows that psychologists spend a good deal of time teaching 
white rats how to run mazes. This is partly because the maze provides a 
good technique for the comparative study of animal behavior, but also 
because the maze is an important source of information about serial learn- 
ing in animals. There are, of course, other techniques for the study of serial 
processes in animals, but since the maze is the most widely used and im- 
portant technique, let us look at it first. 

The maze. The maze is the tool par excellence for the study of the be- 
havior of rats. The rat’s natural habitat is a mazelike series of burrows, and 
It learns mazes efficiently and well. Consequently, most of the studies of 
maze learning have used rats as subjects. 

There has been an enormous variety of mazes used in experimental 
studies. Figure 28 shows the floor plan for one maze. The rat is placed in 
the box marked S, and it must make its way to the reinforcement in the 
box marked G. At each of the choice points, the rat must choose between a 
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blind alley and the path that will eventually bring it to the goal. An error 
is counted when a rat enters one of the blind alleys. In this particular maze 
the correct sequence of turns is a relatively simple LRLR. The maze can 
be lengthened or shortened by additional choice points, and instead of hav- 
ing these arranged for systematic alternation, they can be arranged ran- 
domly. 

Other mazes can be more haphazardly made, so that, for example, the 
distance between successive choice points is not the same. The famous 



Figure 28. The floor plan of a mulfiple-choice-point maze. The number of choices is probobly 

smaller than would be used in practice. Each entry into a blind alley would contribute one error 
to a trial. 


Hampton Court maze is of this sort. Still other mazes will have more than 
one blind alley at a choice point. Finally, some have alleys leading away 
from the choice point at different angles; a right angle is typical (as in a 
multiple T maze), but more acute angles are frequently used. 

The usual animal maze is an enclosed alley. In some experiments, how- 
ever, the animals are allowed to run on open platforms. In enclosed mazes, 
of course, animals cannot see more than one choice point ahead, but in an 
elevated one, it is possible for the animal to obtain some overview of the 
maze pattern. 

There is obviously a wide variety of mazes, each one suitable for a differ- 
ent problem. One simple maze, the linear maze, has been important in 
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theoretical studies of learning. Figure 29 shows an example of a linear 
maze. It is just a long alley which has been divided into three or four com- 
partments. There are anywhere from two to four doors between each com- 
partment. Only one of the doors will sw'ing open; the others will be locked 
shut. The rat must learn which door is free to open and to run to it. An 
important feature of the linear maze is that the choice points closely re- 
semble one another; consequently, stimulus generalization from one choice 
point to another is likely to be much the same. 

Other techniques. Instead of presenting a maze in which the animal must 
run, it is perfectly feasible to provide a sequence of other kinds of instru- 
mental activity. One experimenter (Arnold. 1947), for example, presented 
rats with a series of buttons w’hich had to be pushed in an invariant order; the 
measure of learning in this case was taken from reaction times, or latencies. 

Frequently a Skinner box is modified so that the rat is presented with 



Figure 29. A lineor maze. There are four choice points with two choices at each. There is a 
centering" door between each choice point to prevent the favoring of one door over the other by 
virtue of the previous correct response. 


two levers instead of one. In some cases the rat must press the two levers 
in some particular order — two presses on the right lever and then two 
presses on the left lever (Schlosberg and Katz, 1943). In other cases the 
object is just to see how the rat distributes its choices between two levers. 
In this connection, some experimenters have studied how the rat distributes 
Its choices between pressing the lever and going to the food tray by putting 
the food tray on the opposite side of the box from the lever (Frick, 1953) 

The particular techniques for studying serial learning will vary with the 
problem being studied and the organism being tested. The maze is the 
instrument of choice with the white rat, because this animal is capable of 
learning an extended series of reactions in the maze. With other organisms, 
other techniques are sometimes preferred, and if one wishes to study the 
natural sequences of behavior, the maze has limited usefulness since it 
forces the animal to perform its choice reactions in an invariant sequence. 

Let us first consider the events that occur when an animal is forced to 
learn in the unchanging pattern imposed by a maze. 
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Theories of Maze Learning 

The original investigations of maze learning (Small, 1899, 1900) were 
made with the idea that the rat does not learn a series of movements but, 
rather, locations in the maze. This common-sense notion that the rat learns 
to recognize places has always been held by some investigators of animal 
behavior, but, for a time at least, the predominant view seemed to be that 
rats learned sequences of movements. Today, we think it rather unim- 
portant whether the rat learns sequences of movements or locations in the 
maze, and we suspect that actually both kinds of learning occur (Blodgett 
and McCutchan, 1947). Historically, however, many important ideas were 
aUied with the notion that rats learn sequences of responses in the maze, 
so let us see how this hypothesis arose. 

Watson s theory of maze learning. Perhaps the earliest attempt to ac- 
count for the facts of maze learning in a systematic way is that of J. B. 
Watson. Watson was one of the great pioneers in the study of learning and 
one of the most important of the early behavior theorists. 

Watson assumed that an animal learning to run a maze is learning a se- 
quence of movements. After Pavlov’s experiments on the conditioned 
salivary response became generally available in this country, Watson as- 
sumed that each of the movements in the sequence was much like a Pav- 
lovian conditioned response. (Notice that he was applying the principles 
of classical conditioning to instrumental learning.) Thus, Watson said, the 
basic process involved in habit was that by which a neutral stimulus comes 
to elicit a response when paired with a stimulus which already has the power 
to elicit the response. Watson’s position in this matter was so important and 
fundamental that we can do no better than look at a quotation from the 
second edition of his famous textbook (Watson, 1924, pages 293-294). 

Certainly at birth or shortly thereafter the elements or units out of which 
every habit is formed can be noted. We mention the contraction and flexion of 
the fingers, or of the lower and upper arm, raising and lowering of the hand, 
rotation of the head, bending the trunk from side to side, the backward and for- 
ward, well-systematized movements of the legs, and a host of others. The conclu- 
sion is forced upon us that in habit no new elementary movements are needed. 
There are enough present at birth and more than will ever be combined into 
complex unitary acts. . , . One needs only to examine the five or six day old 
infant to be reasonably convinced that there is no need for the formation of 
additional reflex arcs to account for all later organization. The new learned ele- 
ment in habit is the tying together or integration of separate movements in such 
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a way as to produce a new unitary activity. And by unitary activity we mean 
nothing more than the everyday acts of life, such as reaching out the hand for an 
object that stimulates the eye, picking the object up and carrying it to the mouth 
or laying it on the table. . . . 

We can define habit then as we did instinct as a complex system of reflexes 
which functions in a serial order when the child or adult is confronted by the 
appropriate stimulus. 

Watson applied these general notions about the development of complex 
acts to the analysis of maze learning in rats — an example of how a whole 
series of discrete movements becomes organized into a unitary act. In a 
complicated maze, Watson thought, the rat must learn a series of responses; 
for example; run 5 feet, turn left, run 3 feet, turn left again, etc. He thought 
of the animal which had learned the maze correctly as having synthesized 
a whole chain of conditioned responses. The ingenious feature of Watson’s 
notion was that the conditioned stimulus for each phase of the sequence of 
acts came from the preceding act. The conditioned stimuli in maze learn- 
ing stemmed primarily from the kinesthetic stimuli of the movements them- 
selves. Thus, the ‘^feel” of each movement was the conditioned cue for the 
next movement. 

Watson (1907) thought he had evidence to show that the main sensa- 
tions rats used in learning mazes were kinesthetic. Unfortunately, it turned 
out that Watson’s experiments were faulty and that kinesthetic cues are not 
necessarily the most important ones for rats in maze learning (see Munn, 
1950). Over and above this fundamental error, there were several other 
critical features against Watson’s notion of maze learning as a simple chain- 
ing of conditioned responses. 

If Watson’s basic idea is correct (aside from whether the conditioned 
stimuli are kinesthetic or not), the whole chain of responses ought to be 
very mechanical; each response should be exactly the same each time and 
follow exactly in its proper sequence. Experimental support for this view was 
offered by Watson (Carr and Watson, 1908). Rats were trained in a maze 
in which the alleys could be lengthened or shortened without changing the 
pattern of turns which the animal had to make. Carr and Watson noticed 
that animals trained on short alleys (a turn required after a relatively short 
straightway) attempted to turn at the same place when the alley was length- 
ened. Thus it looked as though the animals had learned a sequence of move- 
ments in which the nth movement was a turn. Apparently they had learned 
to turn right not when the first opening presented itself but after making a 
certain number of movements. 
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Such observations supported the notion of kinesthetic chaining as the 
basis of maze learning. Subsequent experimental work has shown, how- 
ever, that this cannot be the only, much less the principal, way in which rats 
learn mazes. The number of experimental refutations of this notion is large, 
but perhaps the simplest is provided by Macfarlane (1930). He trained rats 
to run a maze m which the passages contained water to the depth of 8 
inches. The rats were thus forced to swim the maze. He then switched them 
to walking by putting in a false floor slightly below the surface of the water. 
Since there was no increase in errors when the animals were transferred 

from swimming to running, the learned maze pattern did not consist entirely 
of a chain of conditioned movements. 

Watson believed that the most important condition determining learning 
and retention of the correct pathway in mazes was that correct responses were 
generally more frequent and also more recent than incorrect responses 
(Watson, 1914). Watson was not able, however, to account for many of the 
special features of maze learning with these notions of frequency and re- 
cency, and the result has been that more modern theories have tended to 
abandon them in their simple form. 

Hull s analysis of maze learning. Hull, like Watson, has placed great 
emphasis upon the conditioned response. He also indiscriminately applies 
the same principles to classical and instrumental conditioning. Unlike Wat- 
son, however, Hull does not rely exclusively upon movement-produced con- 
ditioned stimuli. The result is that Hull has provided us with a theoretical 
analysis of maze learning that, while defective in certain particulars, has 
been the most powerful analysis made to date. 

Hull (1930) differs from Watson in that he allowed external cues from 
the maze to be conditioned to each response. The visual stimuli at a choice 
point, he said, provide the cues for the response of turning to the right. He 
conceived of this process as being like classical conditioning, but it does 
not do violence to his notions to describe turning as instrumental activity 
brought under the control of a particular discriminative stimulus by differ- 
ential reinforcement. Hull follows Watson to the extent of assuming that 
kinesthetic cues can provide part of the chain that keeps responses in their 
proper order. Moreover, since animals are usually motivated during runs 
through the maze, he said that there is a drive stimulus which is conditioned 
to every response the animal makes (Hull, 1930, 1931); in other words, 
since the animal is equally hungry at every point in the maze, the stimuli 
from drive are conditioned to every part of the maze. 
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An essential element in maze learning, according to Hull (1932) is the 
gradient of reinforcement (Chapter 2). Hull calls this the goal gradient, 
and it refers to the gradient of the effectiveness of reinforcement through the 
maze. In general, it states that correct responses close to the goal will be 
reinforced sooner (and hence performed more adequately) than correct 
responses distant from the goal. Hull (1943) largely attributes this goal 
gradient to a diminishing effect of secondary reinforcement as the animal 
gets further from the goal. The goal gradient, of course, is a temporal gradi- 
ent; it functions as a spatial gradient in the maze only because of the invariant 
order of choice points in the maze. 

One thing the concept of the goal gradient allows Hull to do is deduce 
that the maze will be learned. This is no mean accomplishment; one of the 
real deficiencies of Watson's notions of frequency and recency is that they 
did not unequivocally predict that the rat will learn the maze, a fact pointed 
out by Thorndike (1915). However, it is clear that any time an animal 
makes an error, the reinforcement of that error will be further removed in 
lime than the reinforcement of the correct response at the same choice 
point, for if an animal enters a blind alley it takes it some time to get out. 
The net result is that correct responses will be more strongly reinforced 
than incorrect ones. Another deduction from the goal gradient is that if 
the rat is allowed access to two paths to the goal box, eventually it will 
come to prefer the shorter path. 

A third prediction, one that has not always met with success, is that blind 
alleys near the goal will be eliminated before blind alleys at the beginning 
of the maze. Spence ( 1932) has shown that this is generally true, but there 
are many specific cases in which the backward elimination of errors is ob- 
scured by other factors, such as the direction (relative to the goal) in which 
a particular blind is pointing. 

There is one other mechanism postulated by Hull that is important in the 
analysis of maze learning. This is the jractional anticipatory goal response. 
We have already seen that Hull suggested that a persisting stimulus, prob- 
ably arising from the drive state, is conditioned to every response in the 
maze. It is most strongly conditioned to the goal response, that is, to the 
response of eating food. It is possible, however, that this persisting stimulus 
may prematurely elicit the goal response at some point earlier in the maze. 
This fractional anticipatory goal response allows Hull greater freedom in 
accounting for the data of maze learning. He can, for example, use this 
notion in deriving the prediction that animals learn different pathways in 
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the same maze depending upon whether they are hungry or thirsty (1952). 

Also, this notion leads to the correct prediction that when maze behavior 

IS extinguished, it does not extinguish serially as one might expect, but 
seems to disintegrate all at once. 


The fractional anticipatory goal reaction is much more of an ad hoc 

notion than that of the goal gradient, and sometimes Hull and his students 

have used it too freely in explaining the results of experiments. Hull uses 

this concept to describe much activity which other investigators have usuaUy 
named anticipatory or cognitive. 


Studies on the linear maze. Hull and his students have given a good 

deal of attention to the study of behavior in linear mazes, or more generally, 

to situations in which a series of identical or nearly identical responses are 

required. The linear maze, as we described it earlier, is one in which all 

choice points look exactly alike. These points usually consist of a number 

of doors, only one of which will open into the next compartment of the 
maze. 


Since the choice points in a linear maze look so much alike, there is a 

great deal of stimulus generalization from one to another. Because some of 

the choice points are closer together than others — choice point II is closer 

to III than to IV — some will generalize more to one than to others. Thus, 

if the animal learns to go through the second door at choice point I, it will 

generalize this tendency rather strongly to II, somewhat less to III, and 

much less to IV. Likewise, if the third door is correct on choice point IV, 

the animal will strongly generalize the tendency to go to the third door on 

choice point III; it will generalize this tendency a little less on point II and 
very little on I. 

We can easily see that these tendencies, based upon generalization, lead 
to two kinds of errors at any given choice point. Anticipatory errors are 
those which are incorrect at the choice point at which they occur but cor- 
rect later on. Thus the rat may choose door 1 when it should have chosen 
door 2, but 1 is correct at the next choice point. In this case, the rat has 
anticipated the second choice point. Likewise, perseverative errors are re- 
sponses which are incorrect where they occur but which were correct at 
some earlier choice point. 

The effect of anticipatory and perseverative errors has been studied in 
the linear maze under a wide variety of conditions. In some experiments 
the correct door at each choice is different. In other experiments the same 
door is correct at every choice point. In some experiments the animals were 
reinforced only at the end of the maze; this procedure allows the familiar goal 
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gradient to operate. In other experiments, however, the animals were rein- 
forced after each choice in the series; this produces a radically different 
gradient of reinforcement. 

The results of all of these experiments have been compared with the 
theoretical deductions derived from Hull's general theory of learning. In 
most cases the experimental results and the theory agree. We can see very 
simply and intuitively how the results of these experiments could support 
the theory. 

It is easy to see, for example, that when a different door is correct at each 
choice point, the anticipatory and perseverative tendencies would produce 
nothing but errors. Moreover, since the middle choice points are influenced 
by both ends, the greatest number of errors during learning would be in 
the center of the maze. This is actually the case in experimental results with 
the linear maze (Hull, 1947). If, when a different door is correct at each 
choice point, there is reinforcement only at the end of the maze, there are 
fewer errors at the end of the maze than at the beginning. If each choice is 
reinforced, however, there is much less difference in errors between the two 
end ones, and in actual fact, there are fewer errors at the beginning than 
the end (Hull, 1948). 

If the same door is correct at every choice point, anticipation and per- 
severation are beneficial to learning, for however the animal anticipates or 
perseverates at any choice point, it still makes the correct choice. Since 
perseveration cannot occur at the first choice point and anticipation cannot 
occur at the last one, anticipation and perseveration are greatest in the 
middle; therefore, there are fewer errors in the middle (Sprow, 1947; 
Gladstone, 1948). 

In addition, we have some experiments (Smith, 1949; Woodbury, 1950) 
that show cases in which the choice points are neither all the same nor all 
different. In these experiments, the point at which the maximal number of 
errors occurred was in accord with the hypotheses of Hull. For example, 
if the first three choice points are all the same, but the last one different, 
then the greatest number of errors occurs at the next to the last choice point, 
since the animal anticipates the last choice point most strongly here. 

Remember that anticipation and perseveration are special cases of 
stimulus generalization. The rat in the linear maze, when faced with a choice 
point, will not always be able to tell which compartment it is in. This is a 
failure of discrimination and is therefore stimulus generalization. Hence, 
these studies on the linear maze tell us that the principles derived from a 
study of single instrumental and classical conditioned responses are useful 
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m analyzing a relatively complex form of behavior. These studies show the 
way in which generalization of stimuli associated with particular responses 
may interact to produce a complex pattern of behavior. 

Other serial problems. The analysis of maze learning presented by Hull 
and his associates has been generalized to all sorts of problems. Hull (1952) 
treats m detail sequences of responses in a modified Skinner box and a 
special problem, double-alternation. The important thing about his treat- 
ment is that essentially the same rules apply to these cases as apply to maze 
learning. In other words, it is necessary to consider generalization from one 
response to the next, gradients of reinforcement, and all the other mecha- 
nisms of interaction between responses. The way in which these factors 
combine to produce interaction between responses, of course, depends upon 
the particular problem. The cases which Hull has considered have nearly 
always been simplified so that the interaction between the various factors 
is not too complicated. For example, in the linear maze all choice points are 

alike so that the same amount of stimulus generalization goes from one 
choice point to the next. 

There are, of course, many peculiar features in particular serial acts such 
as maze performance, that cannot be accounted for by mechanisms we have 
been describing thus far. Many of these are peculiar to the rat or even to 
the rat in particular mazes. Consequently, they are not of much general 
interest to the psychology of learning. There is a thorough discussion of 
factors limited to specific cases of maze learning in Munn’s Handbook of 
Psychological Research on the Rat (1950); some of these factors are im- 
portant to the comparative study of behavior. 

Statistical Sequences of Behavior 

In the maze, animals must perform responses in a more or less invariant 
sequence they must first make a choice at the first choice point, then at 
the second, etc. In recent years, however, psychologists have begun to study 
natural behavior sequences in which animals can emit responses in a 
variety of orders. In doing so, they have examined in greater detail some 
examples that we had come to think of as invariant sequences of responses. 

For instance, we have treated behavior in the Skinner box as if it were much 
more uniform than it really is. We have more or less assumed that each 
time the rat presses the lever it immediately turns to the food tray for the 
reinforcement. This is by no means true. Even when the food tray is placed 
directly under the lever, the rat will occasionally press the lever several 
times before eating the food. This may even be the case when the pellet 
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dropped must be eaten before further pellets are released. Interesting 
problems arise when we deal with these naturally occurring sequences of 
behavior, and it will be worth our while to examine them in some detail. 

Statistical behavioristics. First of all, we need a method for analyzing 
the sequential interaction ol responses. Such a method is provided by what 
has been called statistical behavioristics (Miller and Frick, 1949). The 
essence of this method involves computing the probability of a particular 
response occurring after some other response has been emitted. We may 
ask, for example, what is the probability of turning to the food tray after 
lever pressing? Obviously, it is high, though not perfect, because sometimes 
a second lever press will follow the first. What is the probability of return- 
ing to the food tray after eating? It is obviously low, for the rat, rather than 
return to the empty tray, will press the lever again. Thus, it is clear that the 
probability of turning to the food tray does not stand by itself, but depends 
upon what the organism did just before. Thus, we may speak of the de- 
pendent probability of a response, and this, in general, means that the 
probability of a particular response depends upon what preceded that 
response. 

By analyzing the dependent probabilities in sequences of behavior we 
can discover many things. We can, for example, compute an index of stere- 
otypy, which tells us how regular and repetitive patterns of behavior arc. 
Let us look at an example of the application of conditional, or dependent, 
probability analysis to a problem in trial-and-error, or multiple-response, 
behavior. 

Dependent probabilities and trial-and-error behavior. To show how an 
analysis of dependent or conditional probabilities applies to sequences of 
responses in trial-and-error behavior. Miller and Frick (1949) have com- 
pared the behavior of a rat and a child in a multiple-choice experiment. The 
problem for the rat and the child was to get out of an enclosure with four 
doors. Since the door which permitted an exit varied from trial to trial, it 
was useless for the learner to attempt to try again the door which had been 
correct on the previous trial. The original purpose of this experiment, which 
is a part of an older study ( Hamilton, 1916), was to get an extended sample 
of trial-and-error behavior in an insoluble problem. 

Miller and Frick point out that if one attempts to predict which of four 

possible doors the child will pick in any one trial, the best guess is chance 

unless information is available on the relative frequency of previous 
responses and on the response immediately preceding the response in ques- 
tion. Once we have this information, however, we can increase our predic- 
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tion over chance. If, for example, a sequence of two responses in the child 

is considered, we find that of the 16 possible sequences (four doors times 

four doors), 6 sequences appear 80 per cent of the time. Thus a knowledge 

of the choice just before the response greatly increases the accuracy of our 
prediction. 

When Miller and Frick analyzed the data involved in sequences of three 

responses, they found great regularity in the third choice. It was nearly 

always a response which was different from the preceding two. Thus, if the 

child chose doors 1 and 2, there was a very low probability that either doors 

1 or 2 would be chosen as the next response. Miller and Frick point out 

that this is one respect in which the child differs markedly from the rat. 

The rat was much more likely to go back to one of the doors chosen on the 

preceding two responses. This suggests a longer memory span for the child 
than for the rat. 

Dependent probabilities in the Skinner box, A few paragraphs back we 
said that the sequence of lever pressing, then turning to the food tray, and 
then back to the lever was not inevitable in the Skinner box. Sometimes, 
animals press the lever two or three times before going to the food tray. A 
detailed analysis of sequence in the Skinner box has been made (Frick 
and Miller, 1951) by altering the box as shown in Figure 30. The food 
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^ floor plan of a Skinner box used to study sequences of lever pressing and eating. 
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tray is placed on the opposite side of the box from the lever; this makes 
the rat go back and forth across the box. The investigators tabulated the 
number of times the rat pressed the bar and the number of times it ap- 
proached the food tray, as well as the order in which these responses 
occurred. Thus if A means approach to the lever and B means approach 
to the food tray, a typical sequence of responses might be as follows: 
AAAABBBBBBAAAABBAAAAAAABABBBAAA, etc. 

These investigators studied the pattern of responses both before and after 
conditioning. Surprisingly enough, behavior was more stereotyped before 
conditioning than after. Stereotypy in this case usually meant repeating the 
same response in runs. In other words, if the animal approached the food 
tray (empty) before conditioning, it was more likely to repeat this act than 
it was to turn around and approach the lever. After conditioning, however, 
this pattern was broken. There was a tendency to alternate the responses 
(as perhaps the animals should) but this was less regular than the repetitive 
pattern before conditioning. 

In another study, Frick (1953) applied this method of analyzing be- 
havioral sequences to the problem of the gross effects of anxiety or fear 
on behavior. In this case, the patterns of A’s and B's were studied by the 
autocorrelation function. The autocorrelation function tells to what extent 
later responses are dependent upon earlier responses and how many re- 
sponses back this influence may extend. The results of Frick’s study showed 
that a conditioned anxiety response induced by electric shock reduced the 
patterning in the lever-pressing-food-tray sequence without affecting the 
over-all rate of responding. 

These studies represent initial steps in the study of the temporal organiza- 
tion of behavior. It is probable that we shall see many new applications of 
the basic technique of studying dependent probabilities in the near future. 
As we shall now see, these same techniques have been applied to problems 
in human behavior. 


HUMAN VERBAL LEARNING 

The most important and obvious problems in the serial learning of human 
beipgs occur in verbal behavior. Our verbal behavior consists of chains of 
words and phrases emitted in particular orders. These are clearly important, 
since the words do not readily make sense in a scrambled order. Try the 
following sequence on someone: “C will personnel by 20 applications room 
of new for building be positions in the officer received.” The correct se- 
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quence is “Applications for new positions will be received by the personnel 
of verbTl b°r ‘’“f the learning and emission 

our TottZlZ'" “ 

There are Other matters of great importance about verbal behavior, of 
course Words carry meaning, and there are a variety of techniques for 
eva uatmg the role of meaning in verbal behavior, some of which le shall 

?rol Lr H h"' that verbal behavior can be con- 

o led and shaped by reinforcement in much the same way as the behavior 

to c \ \ o (t955), for example, was able 

bv nm? ^ interviewlike situation 

by making plural nouns contingent upon some simple indicator of satisfac- 

t.on on the part of the interviewer (saying “um-hum”). Other studies show 

orlr f h """ controlled by stimulus conditions and rein- 

forcement schedules much as can any other instrumental activity (Buss, 

einer, and Buss, 1954; Verplanck, 1956). While this is important, the 
mere fact that verbal behavior is like any other class of instrumental be- 
havior shows that this kind of control is not important to the unique proper- 
les of verbal responses-their chaining and their meaning. Let us therefore 
turn directly to the serial characteristic of verbal learning. 


Methods in Rote Verbal Learning 

In rote verbal learning we consider the influence of chaining of responses 
upon learning, performance, and retention. Ordinarily, when we are faced 
e tas of learning some new verbal responses such as memorizing a 
poem we sit down with the material in front of us and, by a hit-and-miss 
proce ure, earn it. We seldom memorize a poem by starting atThe begin- 

we repeat lines in small 

groups or skip from one part to the next. 

This hit _ c. ^nd^iss procedure will not do if one is interested in studying 
response organization, or chaining, in verbal learning. In laboratory studies 
of human learning, we are interested in the relationships between the parts of 
the material to be learned and how these relationships determine features 
of learning. In order to study these, it is necessary to hold constant the 
order in which the material is presented to the subject. Thus, in laboratory 
earning, the method of learning is partially controlled by presenting to the 
su^ect only one unit of material at a time in some predetermined order. 

This technique reduces variability in methods of learning among sub- 
jects; that is, it makes our subjects learn more nearly in the same way. 
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Since the technique permits a control of serial relationships in the material 
to be learned, it also makes possible a systematic study of the influence 
these relationships have on learning. The serial-interaction effects are funda- 
mental to verbal behavior, and it is essential that laboratory studies of 
human learning be well controlled in this respect. 

Types of verbal material. As we shall see later, one of the most im- 
portant determiners of rate of verbal learning is the type of verbal material 
learned, and it is important that this be carefully specified in every experi- 
ment. Furthermore, the \’erbal items in a gi\'en chain differ enormously 
from one another in important characteristics. Look, for example, at any 
of the sentences on this page. Such words as “in.” "a," "the.” etc., are obvi- 
ously different from such words as "control," "material,” "methods,” etc. 
Consequently, seldom have investigators of verbal learning used ordinary 
sentences in natural languages as experimental material. Usually psycholo- 
gists have made up their own verbal material so that some of the uncon- 


trolled factors in ordinarv sentences are eliminated or held constant. 

One of the first investigators of human verbal learning, Ebbinghaus 
(1885), devised the most widely used technique for controlling the kind ot 
verbal material used in such experiments. He showed that we can use the 
previously acquired verbal skills of human beings, and at the same time 
eliminate previous learning for the specific verbal material we wash to have 
the subjects learn. This can be accomplished with nonsense syllables. These 
are syllables novel enough to ensure that a uniform minimum of the sub- 
ject’s verbal skills is called upon. Indeed, if our syllables are truly “non- 
sense,” the subject cannot bring into the experiment a large assortment 
of acquired reactions to the specific items he is to learn. 

The nonsense syllable is usually constructed of three letters, most fre- 
quently two consonants w'ith a vowel between them. Thus syl or nep is a 
nonsense syllable. Some of these three-letter nonsense syllables resemble 
English words more than others (for example, compare xyq and get). 
For this reason, several investigators have “calibrated” nonsense syllables 
in terms of their association value (Glaze, 1928; Hull, 1933). Such scaling 
has resulted in classifications of nonsense syllables according to the degree 
of meaning or familiarity. Incidentally, the association value, in terms of 
the number of associates given to nonsense syllables and words, has re- 
cently been itself used as a variable in verbal learning (Noble, 1952b; 
Mandler, 1955). 


Other kinds of material, such as lists of adjectives, etc., have been cali- 
brated in many w'ays. Melton (cited in Hilgard, 1951), for example, has 
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scaled lists of adjectives according to a measure of their degree of re- 
semblance. This kind of calibration is important in the study of generaliza- 
tion or similarity in verbal learning. Various statistical tables (Thorndike 
and Lorge, 1944) giving the relative frequency of occurrence of words in 
ordinary English prose are an important source of controlled verbal material 
for the study of learning. The verbal material an investigator wishes to use 
in a particular study will depend upon the purposes of his study. There is 
now available a wide variety of material, and as new needs arise, new con- 
trolled material will be developed. 

The serial method. In rote verbal learning the method of presenting 



Figure 31. A memory drum. The syllables or words ore printed on the paper behind the shield. 
The drum does not move continuously but moves each item into place and then waits a brief time 
before moving the next one into place. (Courtesy of Ralph Gerbrands.) 

the material is usually controlled in order to force certain kinds of rela- 
tionships between the items in the material. In the serial method the ma- 
terial is forced into a rigid sequence. A memory drum (see Figure 31) is 
usually employed to achieve this order. The items to be learned (nonsense 
syllables, for example) are exposed to the subject one at a time in the 
small window in the memory drum. These items are usually exposed at a 
constant rate, so that the same amount of time is spent on each item. The 
subject is instructed to anticipate each syllable before it appears in the win- 
dow. There is sometimes a blank space between each syllable, which allows 
the subject time to anticipate. Usually the subject is instructed to spell 
rather than say the syllables, if nonsense syllables are used, in order to avoid 
pronunciation difficulties. 


SERIAL LEARNING 

In the serial method, each item serves in turn as a stimulus and as a 
response. For example, the first item in a list might be the nonsense syllable 
SYJ. When the list begins on each trial, the stimulus for this syllable is pro- 
vided by the symbol used to indicate the beginning of the list. The next 
syllable might be loz. syj is the stimulus which tells the subject that Loz is 
coming up. loz in turn might be followed by nep. Thus, syj is the stimulus 
for the response “l-o-z.” loz is the stimulus in turn for the response 

“n-e-p.” 

Table 1 shows the results of an experiment on rote verbal learning with 
one subject. In the columns are given the 12 syllables in the list. The rows 
represent the trials, and there were 20 given to this subject. Note that a trial 
is defined as one repetition of the entire list. The plus signs represent correct 
anticipations, and the zeros represent incorrect responses or no responses 
at all. In some experiments, incidentally, we may be interested in recording 
the exact errors made by the subject. The first time through the list, of 
course, the subject can get no responses right, since this is the first time he 
has seen the material. Notice that this particular subject guessed at the first 
syllable on the second trial and got it right. On the third trial he correctly 
anticipated the first syllable, the second syllable, and the last syllable. On 
the nineteenth and twentieth trials the subject anticipated all the syllables 

correctly. 

If we are interested in studying the retention of verbal material, some 
time after the original learning we may give a test of retention. Such a test 
could be obtained in exactly the same w'ay the learning data were obtained, 
the subject is required to anticipate each syllable in turn. On other occa- 
sions, we may be interested in a free-recall test; the subject is simply asked 
to recall as many of the syllables as possible in their correct order. 

The paired-associates method. For many purposes there is one grave 
defect in the serial method — each item serves as a stimulus and a response 
in turn. Sometimes it is important to sort out and independently vary the 
stimulus and response aspects of a task to be learned. This cannot be done 
with the serial method; consequently, a modification known as the method 
of paired associates has been developed. 

In this method, as its name implies, pairs of items are presented to the 
learner. He is instructed to learn the pairs in such a way that when the first 
member of the pair appears, he is able to recall the second. Thus the 
first member of a pair is a stimulus item and the second member a response 
item. Unfortunately, in this method, the serial order of presentation cannot 
be retained, since if it were, subjects could make a connection between re- 
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sponse items and the stimulus items which follow. Therefore, in this method, 
the pairs of stimulus and response items are presented in some random or 
prearranged order which is different for every trial. Obviously, the method 
of paired associates destroys the simple serial characteristics of rote learn- 
ing, but it enables the investigator to distinguish between stimulus material 
and response material. For some problems the simple anticipation method is 
more suitable; for others the method of paired associates provides the most 
useful technique. The uses of this method will be described in later chapters. 

Associations in Rote Verbal Learning 

One of the fundamental problems is the degree to which different items 
in a verbal-learning task are associated with one another. In the method of 
serial learning we know that successive items are associated, since otherwise 
the learner would not be able to anticipate. It turns out, however, that 
learners also make connections between items that do not adjoin. These are 
called remote associations. 

The nature oj remote associations. In a typical experiment, nonsense 
syllable A becomes a cue or stimulus for the response of nonsense syllable 

B. B is a cue for the response C, etc. The subject learns the list in order. 
In addition, however, syllable A is connected to some degree with syllable 

C, so that A will occasionally give rise to the response C. Such responses, 
of course, are analogous to anticipatory errors found in the studies of the 
linear maze. In verbal learning these anticipatory responses are called 

remote forward associations. 

Since perseverative errors are found in the linear maze, it might be sup- 
posed that the same errors would be found in rote verbal learning as well. 
Indeed, they are. If the learner responds with A when he should have re- 
sponded with D, a perseverative error has occurred. In verbal learning, 
such errors are called remote backward associations. 

Ebbinghaus (1885) was the first to detect these remote associations 
through a technique known as the method of derived lists. First he learned a 
list of nonsense syllables — Ebbinghaus always used himself alone as a sub- 
ject — then learned a second list obtained by systematically rearranging the 
first list. Sometimes in the rearrangement every other item was made ad- 
jacent, sometimes every third item, and sometimes every fourth. Ebbing- 
haus’s results showed that he could relearn systematically rearranged lists 
much more quickly than he could relearn ones haphazardly rearranged or 
an entirely new list. Furthermore the fewer the number of items skipped in 
the rearranged lists, the quicker the time of learning. From this- Ebbinghaus 
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inferred that remote (forward) associations had helped him in the learning 

of the rearranged lists, and therefore that these associations must have been 
learned in the original lists. 

Ebbinghaus s results tell us little about the specific remote associations 

formed. Nor finally, does his method indicate the frequency of remote asso- 
ciations. 

A method used by McGeoch (1936) tells us a good deal more about the 
exact nature of remote associations. He asked subjects to practice, by the 
method of anticipation, memorizing a series of nonsense syllables. He 
stopped the practice before the subjects had learned to anticipate perfectly 



Figure 32. Number of associations as a function of the degree of remoteness (syllables spanned). 
{Data from McGeoch, 1936.) 


and then gave them a free-association test with the nonsense syllables they 
had been practicing as stimuli. That is, he read in a scrambled order the non- 
sense syllables the subjects had been learning and asked them to respond 
with the first syllable which came to mind. Figure 32 shows the frequency 
of association as a function of degree of remoteness. Notice that the number 
of remote associations, both forward and backward, falls off abruptly after 
zero degrees of remoteness; in addition, it appears that forward associations 
are slightly more numerous than backward associations, at least at the first 
degree of remoteness. 

Interpretations of remote associations. We shall deal with the signifi- 
cance of remote associations later, but now let us look at some of the no- 
tions about how remote associations are brought about. One idea is that the 
earlier items are still being responded to when the later items appear. Thus 
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there is an association between a trace of the earlier item and the occurrence 
of the later one. This theory has been greatly elaborated by Hull and his 
associates (Hull, et al., 1940) and used to explain many characteristic phe- 
nomena of rote serial learning. 

Another theory interprets remote associations as mediated connections 
between two items in a list and a third item common to them. The medi- 
ating factor might be something within the subject himself or some constant 
feature of the environment (the context) to which the specific items on 
the list become connected (McGeoch, 1942). There is experimental evi- 
dence that mediated associations can produce learning. Suppose, in a 
paired-associate experiment, we ask subjects to associate nonsense syllables 
to common words, so that given the nonsense syllable, the subject can recall 
the appropriate word. For example, given cef, the subject would learn to 
say “stem.” Now “stem” is a strong eliciting stimulus, in a free-association 
test, for the word “flower.” And “flower,” in turn, is a strong eliciting stimu- 
lus for “smell,” but “stem” does not tend to elicit “smell” directly. If we 
now take a series of such chains and ask the subjects to learn the third mem- 
ber of the chain (smell) to the original nonsense syllables, apparently they 
learn more rapidly than they would learn a list of unrelated words (Russell 
and Storms, 1955). In other words, the associative habits built into the 
subjects before the experiment result in the mediation, or transfer, of the 
pairing of the same nonsense syllables from one list of words to another. 
This, of course, tells us nothing about how such mediational activity comes 
about, but it does suggest that associations between two items arise from 
the mediation of chains of intervening associations. 

Remote forward and backward associations in rote verbal learning may 
have more than a superficial similarity to anticipatory and perseverative 
errors in linear maze learning. Anticipatory and perseverative errors are at- 
tributable to stimulus generalization. It is difficult, however, to see how 
simple gradients of stimulus generalization in verbal learning could be based 
exclusively upon resemblances among the items themselves. Such similarities 
would not always produce gradients of remote associations, since the re- 
semblances might be distributed in a chance manner throughout a list. It is 
possible, however, that the context in which a particular item is to be emitted 
allows for stimulus generalization (see Schulz, 1955). If, for example, in a 
list of nonsense syllables Dox is followed by cet, dox and the context pro- 
vided by having gotten through so many items in the list elicit cet. Suppose, 
however, that the context generalizes to some later point in the list, dox might 
then elicit some other response, say zym. In intuitive terms this would happen 
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when the subject had “lost his place” in the list. It is even possible that such 

contextual generalization might be aided by random resemblances among the 

syllables themselves, as perhaps would be the case if doq preceded zym 
in the list. 


If the notion that contextual and item generalizations are responsible for 

remote forward and backward associations is correct, then the number of 

remote associations ought to increase directly as the similarity between 

Items in a list is increased. Although there is no direct evidence that this is 

so, there is some indirect evidence. Underwood and Richardson (1956) 

find, for example, that rate of learning is a function of the similarity of 

Items within a list. The higher the similarity, the lower the rate of learning. 

This means that the greater the generalization between items on a list the 

slower the learning, and, by implication, this lower rate of learning is the 

result of a greater number of remote associations intruding at the wrong 
point in the list. 


The Serial-position Effect 

For a long time students of learning have been aware that there must be 
an intimate connection between remote associations and other characteristic 
phenomena of rote verbal learning. One of these is the serial-position effect. 
This effect refers to the fact that the rate at which a given item in a serial 
list of Items is learned depends upon the location of that item within the 
list. When learning is by the serial method, the beginning of a list of non- 
sense syllables is learned most quickly and the end of the list is next easiest 
to learn. For this reason, the serial-position effect has sometimes been called 
the primacy-finality effect, to emphasize this point. 

Serial position in rote serial learning. We can see what the serial-posi- 
tion effect looks like by examining the curve in Figure 33. This curve is 
taken from an experiment on the memorization of nonsense syllables by the 
serial method (Hovland, 1938b). The total number of errors or failures of 
anticipation at each position in the list is shown on this curve. The greatest 

number of errors occurred at the seventh syllable position, which is just 
past the middle of the list. 

The shape of this curve has wide generality. The most difficult part of a 
list of serial items learned by anticipation is just past the middle of the list. 
As more items are added, the point of maximum errors always remains in 
the same relative position (Hovland, 1940). The serial-position curve is 
essentially the same for meaningful words as it is for nonsense syllable'^, 
though, of course, the total number of errors is less in the former case (^ 
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Crary and Hunter, 1953). If one compares early performance with late 
performance in learning, of course, the serial-position curve is flatter in the 
later case, but this is simply because all of the items are finally learned 

(Ward, 1937). 

Explanation of the serial-position effect. The best-known attempt to 
explain the serial-position effect for anticipatory rote learning is that of 
Hull (1935). Hull argued that remote forward associations inhibited im- 
mediate associations between them. For example, a remote association be- 



Figure 33. The number of errors ot each position in a list of nonsense syllables during the learning 
of the list by serial anticipation. (Hovland, 1938.) 

tween items 1 and 9 would inhibit or tend to reduce the tendency for all 
intervening direct associations to occur. Consequently, the items in the mid- 
dle of the list should be inhibited in proportion to the number of delayed 
associations which span them. This notion is illustrated in Figure 34 (Hull, 
1935). Notice that the greatest number of delayed associations bridges the 
middle items, and this is a way of representing the notion that the most 
difficult items to learn should be in the middle. 

There are two things wrong with Hull’s hypothesis. First of all, it ignores 
remote backward associations; this is important because remote forward 
and backward associations are not perfectly symmetrical. Secondly, it ig- 
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associations spanning thot portion. {Hull, 1935.) ' ” Pos'»*on is proportional to the number of 

'earn in actual 

in« ^ut 2 'It ^ ■" " rote learn ’ 

«a!ded as T complicated that they have been re- 

garaed as rather unconvincing. 

Bugelski (1950) pointed out that remote associations could much more 
vmcmgly account for the serial-position effect if we simply assume that 
ey interfere with the emission of correct responses. This would place the 

ski’! ana‘lvr''''r' ''''' while Bugel- 

it d!es not 3 serial-position curve. 

It does not produce as much as actually exists in practice 

ing ffiniSr" " •!™"’ primary factor in produc- 

coun^L the f ‘hi^ directly by 

the usual metho *!! ° responses at each position. Unfortunately, 

does no!^ presenting material in serial-anticipation experiments 

does not permit this to be done. The syllables are paced at a moderately 

intrusions**Th°^^ ° simply failures of response, rather than 

ntrusions. This may be because the subject takes too long in getting the er- 

Taraor^T n" next syllable 

thev XlTd r'" (1952) performed an experiment in which 

WhL this • l°r the subject to respond between syllables. 

other itel f associations (intrusions of 

miHHl f fh 7"^ almost symmetrically distributed about the 

to just* past th '** • displacement of maximum difficulty from the middle 

!bWf T explained by the fact that, as the list proceeds, the 

bjects are simply unable to respond with even a reasonable guess. Thus it 
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appears as though the serial-position effect is composed of two components, 
an almost symmetrical component of overt intrusions or remote associations, 
and an increasing tendency through the list to be unable even to guess. 

Though the tendency to make intrusive errors based upon remote associa- 
tions accounts for the bowed shape of the serial-position curve for rote 
serial learning, it does not account for the skewing, or displacement, of the 
cur'e. This displacement is probably the result of a combination of several 
factors including memory span and perhaps something like the reactive in- 
hibition factor discussed in the chapter on extinction. 

The seriaTposition curve in free recall. Suppose that a list of nonsense 
syllables or randomly arranged words is presented to a subject, and, instead 
of recall by serial anticipation, the subject is allowed to recall the words in 
any order he chooses. Under these conditions the most frequently recalled 
words are at the end of the list (Welch and Burnett, 1924; Deese and Kauf- 
man, 1957). Two sample serial-position curves for such free recall are pre- 
sented in Figure 35. (Note that these curves are plotted in correct responses 
rather than errors.) Thus it appears that the serial-position curves for serial 
anticipation and for free recall are roughly the reverse of one another. In 



Figure 35. Serial-position curves for free recall. In contrast to the curve in Figure 34, these curves 
ore presented in terms of the number of items correctly recalled. The highest frequency of recall is 
a! the end of the list. (Deese ond Kaufman, 1957.) 
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one, the greatest number of correctly recalled items is at the beginning of the 
list, and in the other, the greatest number is at the end. 

It turns out that in free recall the greatest determinant of the probability 

o recall (as well as the order of items in recall) is the associative strength 

ot the Item (Bousfield, Cohen, and Silva, 1956). Thus in the immediate free 

reca 1 of randomly arranged items it is probable that the last few items in 

e ist are strongest, perhaps because of the persistence of an image of 

them ; therefore, these items are recalled first and have the greatest probabil- 
ity of being recalled. 


Something quite different seems to happen with the free recall of con- 
nected prose, however. If brief paragraphs of English are used as learning 
material and recall is unrestricted, the resulting serial-position curve looks 
more like that of the method of anticipation rather than that of the free 
recall of random items (Deese and Kaufman, 1957). This is, of course, 
because the recall of connected material is not really free; the serial charac- 
teristics of ordinary prose force the subject to start from the beginning and 
go through to the end, at least if he is to take advantage of the redundancy 
(see below) of language to maximize his recall. Thus, free recall of unor- 
ganized material is markedly different from that of highly organized ma- 
terial. A systematic exploration of this difference shows that as more and 
more organization is introduced into a list of words, the characteristic of the 
serial-position curve changes gradually from that of the free recall of ran- 
domly arranged items to that of recall by serial anticipation (Deese and 
Kaufman, 1957). This result seriously suggests that the method of serial 
andcipation is much more like the recall processes used for ordinary lin- 
guistic material than we had heretofore supposed. 


Statistical Organization of Verbal Material 

In the last few paragraphs we have discussed the question of organiza- 
tion in verbal material without really saying what we mean by “organiza- 
tion. In this section we shall discuss the meaning of verbal organization of 
a particular kind — statistical organization — and how it affects the sequen- 
tial learning of verbal material. Statistical organization involves taking ac- 
count of dependent probabilities between items in exactly the same way 
that dependent probabilities have been applied to multiple-choice behavior. 
Since earlier in this chapter we explored this problem, we are already 
familiar with the general concepts. 

A moment’s thought will make it clear that dependent probabilities apply 
to the elements of languages. Take the occurrence of letters in written Eng- 
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iish, for example. Everybody knows from Poe's The Gold Bug that letters 
in English do not occur with equal frequency. The letter e appears more 
frequently than any other letter; consequently, if we were to guess that the 
letter e occurs in a given sequence of letters rather than simply picking a 
letter at random, we should have a better chance of being right. Likew-ise, the 
occurrence of a given letter is influenced by what has gone before. Therefore 
we can use our knowledge of what has preceded in a particular passage to 
predict what is going to happen next. In English, the letter q is always fol- 
lowed by u \ thus the letter immediately after q is perfectly predictable. It is, 
to use the term frequently used in such analyses, an example of perfect re- 
dundancy the u tells us nothing new, and if it w'eren't for habitual usage, 

might just as well be left out. There are associations between all other letter 
combinations in English, though not quite so strong as that between q and u. 
The letter t, for example, is more likely to be followed by h than by m. 

It is easy to see, furthermore, that the probability of occurrence ot a 
given letter will be influenced by the letter which precedes it by two or three 
spaces and more. In English, for example, two consonants are more likely 
to be followed by a vowel. The probability of occurrence of a given letter in 
English, as a matter of fact, is influenced to a significant degree by letters as 
many as seven positions in front of it (Newman and Gerstman, 1952). 

The method of actually calculating the probabilities involved in such or- 
ganized sequences of events in language derives from the mathematical 
theory of communication or information. Information theory, as it is usually 
called, has had important methodological implications for the study of 
human language and learning. We shall not deal with the theory directly in 
this book, although it will indirectly enter into many of the problems of lan- 
guage learning that we shall face. A general introduction to information 
theory is given in Miller’s Language and Communication (1951). For now, 
however, let us simply accept the fact of dependent probabilities and the 
resulting statistical organization in language, and examine some of its impli- 
cations in the serial learning of linguistic material. 

Statistical organization and immediate free recall. A textbook on the 
psychology of learning is hardly needed to tell one that nonsense material 
is harder to learn and recall than connected, logical material. Some experi- 
mental studies show us, however, that it is not necessarily the sensibleness 
of connected material that is primarily responsible for this. 

Miller and Selfridge (1950) show how gradual approximation to the 
statistical structure of ordinary language influences the amount of material 
recalled. An order of approximation to sensible English may be defined by 
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levels of H 7 ^ constructing verbal material with various 

levels of dependent probabilities without going to the enormous labor of 

Z^f probabilities involved. If we select words at ran- 

dom from the dictionary or a word list, we should have a zero-order approxi- 

mation to English. An example of a zero-order sequence might be: “pack 

wouM^?°t^ H hypermetrical gloat.” A first-order approximation 
would select words according to their relative frequencies in English usage. 

Thus common words like “the” and “before” would most likely show up. 
If we gave an individual a word to begin a sentence with and gave the sec- 
ond word of his sentence to another individual for him to start a sentence 

T" a second-order approximation 

from the resulting list of words. An example of a second-order approxima- 

lon taken from Miller and Selfridge’s paper looks like this: “was he went 
to the newspaper is in deep and.” For higher-order approximations, indi- 
viduals may be given sequences of words to use. In these cases, they add a 
word directly after the sequence and drop the first word of the sequence. 
This new sequence is then given to another individual, etc. The words im- 
mediately following the sequence of given words provide us with a higher- 
or er ist. An example of a seventh-order approximation obtained in this 
manner looks like this: “then go ahead and do it if possible while I make 
an appointment I want to skip very much around the tree and back home 
again to eat dinner after the movie early so that we could get lunch because 
we liked her method for sewing blouses and shirts is.” 

We can see that the seventh-order approximation appears to make vague 
sense— a sort of stream-of-consciousness sense. ‘This is because it preserves 
the ordinary dependencies of speech without having anything in particular 
to say. Miller and Selfridge showed that it is this chain that determines how 
much material a person remembers. Up to a point, the higher the order of 
dependency, the easier it is to remember. With adult subjects. Miller and 
Selfridge found little or no difference in the recall scores for a seventh-order 
approximation and good, sensible English. Thus, even though the seventh- 
order approximation is essentially nonsense, it is recalled as well as the 
sensible material in a test of immediate free recall. 

A summary of Miller and Selfridge’s results is given in Figure 36. This 
shows the relationship between order of approximation and per cent of 
words correctly recalled. Notice that above fifth-order approximations there 
seems to be little improvement in recall. This apparently is because this 
and higher approximations preserve the short-range dependencies which 
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the individual can remember. Thus it appears that to some extent meaning- 
ful material is more easily learned because it preserves the short-range 

associations of ordinary language. 

As we mentioned earlier, introducing statistical organization also changes 
the form of the serial-position curve. Deese and Kaufman (1957) studied 
the serial-position curves that resulted from the immediate free recall of 
lists of words with different orders of approximation, introducing the or- 
ganization into the material presented for free recall gradually changes the 
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Figure 36. Per cent of words correctly recalled as a function of the order of approximation to the 
statisticol structure of English. (Mfl/er ond Se/fridge, 1950.) 

characteristics of the serial-position curve from those of recall of unorgan- 
ized material to those of serial anticipation. This seems to be additional evi- 
dence that the statistical organization preserves the short-range anticipa- 
tory associations in ordinary language. It should be emphasized that statis- 
tical dependencies in language do not account for all the greater ease of 
recall for meaningful material; some of it must be due to the context derived 
from the content of the material (Marks and Jack, 1952). 

Recall and organization in nonsense syllables. In another approach to 
much the same kind of problem (Aborn and Rubenstein, 1952; Rubenstein 
and Abom, 1954) a group of subjects was made familiar with a set of 
nonsense syllables. These subjects then learned combinations of these non- 
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svlTallef ' T l' H patterning introduced among the 

1 m condition they learned random arrangements of the 

syllables under other conditions they learned that there were to be patterns 

among the syllables and were instructed in the actual nature of these patterns. 

selves It w'"^ I ^ Patterns, they memorized the syllables them- 

selves. It was clear that learning the organizational rules beforehand enabled 

hese subjects to remember more individual syllables. However, interestingly 

enough they did not profit from the patterns as much as they could have In 

terms of amount of information (which takes into account the existence of 

the patterns) they actually remembered less with highly organized than with 

random patterns. While it is possible that this is merely because they did not 

ave enough practice with the patterns, it suggests that perhaps people, when 

learning, do not take as much advantage of statistical dependencies as they 
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CHAPTER 8 


THE LEARNING OF SKILLS 
AND CONDITIONS OF PRACTICE 

Most of us have at our command an enormous range of skills we have 
patiently acquired over the years. Some of these skills are highly specialized 
and some are fairly universal within a given culture. Most American adults, 
for example, seem to be able to drive an automobile, while only a few of 
us can operate a machine lathe. The psychologist's interests in these skills 
are many and varied. Some psychological problems arise, for example, 
when we ask about the pattern of native abilities that goes into any given 
skill. Other psychological problems come up when we ask about the inter- 
actions of these skills — how an individual may be hindered or helped in 
learning a new skill by previously acquired skills. In this chapter we shall 
} be concerned primarily with the ways of experimentally analyzing the learn- 

ing of complex skills and how such learning is influenced by the methods 
and conditions of practice. We shall largely ignore the question of abilities 
and defer to later chapters the question of interaction among separate skills. 

LEARNING CURVES AND SEQUENCES OF SKILLS 

The first thing to occur to anyone given the task of finding out how 
human beings acquire skills would be to chart the results of practice at the 
task in question. Psychologists spent a great deal of time doing this in the 

* early experimental studies of the learning of human skills. Often as much as 

* a third of an early textbook on the psychology of learning was devoted to 
the measurement of improvement with practice and plotting this improve- 
ment in learning curves. 

There is sometimes much to be gained by this practice. Frequently, in 
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remedial-reading courses or in special-training courses in reading, the in- 
structor recommends that students keep records of their daily reading rates 
and plot these on a chart. There seems good reason to suppose that this is 
a useful teaching device for some skills. Sometimes learning curves are used 

in industry for monetary incentive scales and for evaluation of the success 
of particular industrial training programs. 

In addition to these more or less practical uses, however, it has some- 
times been thought that learning curves would tell us something funda- 
mental about the learning process itself — indeed, perhaps that there was a 
generalized lav/ about the learning of skills. Unfortunately, such hopes 
about learning curves have never quite been realized. However, there are 
general characteristics of some learning curves that are important. 

A theoretical derivation of a learning curve. Thurstone (1930a) pre- 
sented a theoretical derivation of a learning curve based on the assump- 
tion that all acts could be classified into successful components of skills 
and errors which will not lead to successful completion of the act. From 
this and simple assumptions about rate of error elimination Thurstone de- 
rived an equation that describes a learning curve. This curve can represent 
the acquisition of a remarkably wide class of skills. For example, at least 
one aspect of the learning of typewriting can be described by this equation, 
since the results of practice at typing can easily be classified into successes 
and errors. The derivation, however, would yield no information about a 
curve describing the effects of practice on speed of typing. 

One interesting thing about Thurstone’s derivation is that it contains 
some psychologically meaningful numerical constants. There is one, for ex- 
ample, which concerns the rate of error elimination. If the constant has the 
value of zero, no errors will be eliminated because the learner will not profit 
by the performance of an erroneous response, and its probability of reoccur- 
rence will remain unchanged. If, however, this constant has the value of 1.0, 
each time an error is performed its probability of repetition is reduced to 0. 
Thus, what is commonly called trial-and-error behavior can be described as 
happening whenever this error constant is between 0 and 1.0. If this con- 
stant becomes greater than 1.0, however, it means that the occurrence of 
an error eliminates not only this particular erroneous response, but the 
learner profits enough by this mistake to be able to eliminate possible fu- 
ture occurrences of other errors. This, says Thurstone, is the essence of 
rational learning. If a learner can profit by error to the extent of generaliz- 
ing to other possible errors, it certainly is more than trial-and-error learning. 

The possibilities inherent in Thurstone’s analysis of learning curves have 
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never been fully realized. We should note, however, that Thurstone’s anal- 
ysis does not cover all learning curves; it is meant only for those in which 
the comp>onents of performance can be plotted in terms of proportion of 
correct or erroneous responses. Even here the agreement with data is not 
as good as with other theories. 

Types of learning curves. There are many ways in which components 
of skills can be measured. For skills in which the components can be classi- 
fied into errors and correct responses, curves can be plotted which show the 
way in which errors decrease with practice (trials) or the way in which 
correct responses increase. In addition we can plot curves which show the 
length of time it takes to complete a unit of practice at the task. This is the 
kind of curve one would obtain if the length of time it took an individual 
to assemble a jigsaw puzzle were measured. Obviously, time per trial de- 
creases with practice, or we do not have evidence for learning. If, for exam- 
ple, the same individual assembled the same jigsaw puzzle several times in 
succession and there was no decrease in time of assembly, we should con- 
clude that the individual had not shown evidence of learning to assemble 
the puzzle. 

In many tasks we can obtain measures based upon both errors and time. 
In maze learning, for example, we can plot both the number of blind alleys 
a subject enters on a given trial and the time that it takes him to go through 
the maze. 

In addition, in some tasks, we may be interested in plotting the way in 
which accuracy changes with an increase in the number of trials. If we were 
studying the ability of an individual to learn to fire at a stationary target, 
for example, we might plot a learning curve in terms of his deviation from 
the center of the target for each trial. Or, if an individual were tracking a 
constantly moving target, we might plot the amount of time he stayed on 
the target for each trial. 

It should be noted that the measures of improvement are not all neces- 
sarily independent of one another. In maze learning, for example, number 
of errors and time per trial will be highly correlated. This is simply because 
the fewer the entrances into blind alleys, the less time it takes to go through 
the maze. There are fewer mechanical correlations as well. In general, for 
example, an increase in accuracy will go along with a decrease in the 
amount of time necessary to accomplish a unit of work. In other words, 
improvement in one aspect of a task is usually accompanied by improve- 
ment in other aspects as well. This is not necessarily so, but it is generally 
true of learning outside of the laboratory. 
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In addition there are many other changes in behavior during learning which 
often do not get charted. Usually there is a change in attitude of the indi- 
vidual toward the task. A task may be regarded as difficult at the outset of 
learning and as easy after much practice. There may be a change in the 
degree of tension which an individual shows during learning. In the begin- 
ning there may be much tension and anxiety which may be reduced by the 
learning of a task. Finally the mode of attack of the learner will most surely 
vary throughout the course of learning. Thus it is clear that there are many 

aspects of the change in performance with practice, and some of these are 
often ignored. 

Many times learning curves are obtained by averaging together the per- 
formance of a number of individuals at each trial. The learning curve then 
IS a kind of average learning curve. ‘‘Kind of average” is sometimes 
apt, for there are frequently serious distortions introduced by this proce- 
dure. This has been pointed out many times, most recently by Estes (1956). 
Most generally this is true because the form of the curve for averaged 
measurements is not necessarily the same as that for the individual subjects. 
For example, suppose we plotted learning curves for a group of individuals 
who always learned instantaneously (as would be the case if the learners 
had insight ). Thus on some one trial an individual’s score would go from 
zero to the maximum possible score. If some of these individuals learned at 
different trials, the result could be a smooth, gradual learning curve for the 
average of all these individuals. If we looked only at the average curve we 
should infer that the learning of this task proceeded in a gradual way, with, 
perhaps, more learning in the early trials than in the later trials.^ Thus, be- 
cause learning curves for individuals are not always the same as those for 
groups, we must be careful about the inferences we draw. 

Plateaus. If we look at learning curves for individuals an interesting 
phenomenon occasionally turns up. This is illustrated by the curve in Fig- 
ure 37. The task this individual was practicing was receiving telegraphic 
code. Notice that there was a period of about six weeks during which the 
subject did not appreciably increase his ability to receive code signals. 
After this period, there was a sudden spurt in letters received per minute 
which gradually leveled off to a final asymptote. Such a period of little or 
no improvement is known as a plateau. 

The particular curve in Figure 37 comes from a well-known older study 
by Bryan and Harter (1897, 1899), who were trying to determine the 
reason for this plateau, which they thought fairly typical of learning tasks 

1 This would be true if the distribution of occurrences of “insight” were logarithmic. 
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like receiving code. They put forward the notion that a plateau occurs when 
an individual has learned all of the primary skills in a complicated task 
but has not yet organized them into broader units so that he can learn the 
higher-order skills. They argued that learning any skill like telegraphy in- 
volves a hierarchy of habits; that is to say, the learning of the higher-order 
skills depends upon mastery of the lower-order skills. In learning to receive 
telegraphic code, for example, individuals first learn to receive individual 
letters. Then as they master the skill involved in this, they can take advan- 
tage of the redundancy, or predictability, in English and learn to receive 
by words instead of by letters. Finally, they can receive phrases or words 
as units. 



WEEKS OF PRACTICE 

Figure 37. Learning curve for receiving telegraphic code. A plateau occurs in the middle of the 
curve just before the minimum rate is reached. (After Bryan and Harier, 1897.) 

Bryan and Harter argued that the plateau ought to occur at the transi- 
tion between learning to receive individual letters and learning to receive 
words. They were able to justify this interpretation by a study of the way 
in which people learned to receive jumbled letters rather than real words. 
They found that the plateau for meaningful material occurred at just about 
the point at which there was no further improvement in the ability to receive 
disconnected letters and words. Thus, the plateau occurs where the limit 
on the lower-order habits has been reached and where the higher-order 
habits have not yet begun to appear. The results of this comparison of dis- 
connected and connected material can be seen in Figure 38. 

It is obvious that the notion of hierarchy of habits is useful whether we 
are interested in plateaus or not. Something like this hierarchical structure 
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must occur in all tasks in which there is opportunity to recode the material 

into larger and larger units. Thus, learning to typewrite, read music, and the 

basic skill of learning to read itself must involve something like hierarchies 
of habits. 

Plateaus do not always occur. Indeed, in telegraphy they seem to be the 
exception rather than the rule (Taylor, 1943). The transition between or- 
ders of skills is more often a smooth one. Also, there seem to be other causes 



Figure 38. Learning curves for receiving connected discourse, disconnected words, ond discon- 
nected letters. Notice that the plateou in the curve for connected discourse occurs of the point where 
there is no further improvement in the curves for disconnected material. (After Bryan and Harter, 
1899.) 

of plateaus in the learning curve. Swift (1918) argued that plateaus are 
caused by a decline in the learner’s motivation. He pointed out that pla- 
teaus for all learners occur in about the same place; furthermore they occur 
in many tasks which do not involve a hierarchy of habits. Book (1925), in 
his handbook on learning to typewrite, located the lapses in attention and 
drop in motivation at the point where letter habits were making a transition 
into word habits. Thus the failure to improve generated discouragement, 
which furthered the failure to improve. Individuals may become so discour- 
aged at failure to improve that they lose motivation for further practice. 
Indeed, plateaus may well be a fact which keeps adult learners from ac- 
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quiring such skills as playing tennis or playing the piano. A well-motivated 
adult can learn the fundamentals of these skills quite easily, but there then 
follows a long period when devoted practice brings no fruits. At this point 
interest drops and the individual will cease taking lessons. 

Frequently, the limit of performance which learners finally reach is a 
plateau. The author doubts that his typing has improved much in the past 
fifteen years, but there is certainly room for improvement. A little practice 
would eliminate some bad habits and bring in some new levels of organiza- 
tion which would no doubt quickly raise his level of typing. Thus, while he 
is currently at a limit of performance in typing, practice would raise that 
limit. There are surely real physiological limits to our ability to perform 
specific skills; we cannot type faster than our fingers will move. Most of us, 
however, seldom push ourselves to this limit. 


CONDITIONS OF PRACTICE 

The learning of any particular skill is a function of the conditions under 
which it is practiced. The shape of the over-all learning curve may be al- 
tered by change in conditions, or the presence or absence of particular 
components of the skill may be altered. In this section we shall examine the 
influence of some of the more important conditions of practice which have 
been experimentally studied. We shall restrict ourselves to conditions that 
apply to many tasks, since we are interested in general principles of learning 
rather than the characteristics of certain tasks. To take an example: We 
might be interested in the influence of the location of individual letters on 
the typewriter keyboard, since this could tell us something about general 
problems of habit interference, but we should not be interested in the influ- 
ence, say, of depth of keystroke on learning to typewrite. The latter prob- 
lem may well be of interest to those concerned with the specific skill of 
typewriting, but it is of little general interest in the psychology of learning. 

The Distribution of Practice 

The condition of practice that has been explored experimentally more 
extensively than any other is that of the distribution of practice through 
time. For a wide variety of tasks, the question has been asked, Is it better 
to practice a task with as little interruption for rest as possible, or is rest 
beneficial to learning and performance? For most of the experimental tasks 
that have been studied, the results show that some rest is better than none 
at all, and for some tasks the advantage in increased skill is quite large. 
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The problem is complicated; there are many tasks for which distribution 
of practice has little effect, and there are a few for which rest periods actu- 
ally seem to decrease abUity. Because of the complexity of the problem and 
because of the theoretical interest in the distribution of practice, it will be 
necessary for us to examine this condition in considerable detail. Let us 

first look at the experimental findings and then at some of the theoretical 
questions. 

Experiments on the Distribution of Practice 

A simple experiment. A well-known experiment by Lorge (1930) illus- 
trates the basic phenomenon in the distribution of practice. Lorge studied 
the effects of continuous practice for 20 trials compared with the effects of 
practice in which a rest period of one minute or a rest period of one day 
intervened between each trial. The tasks he used were mirror drawing, 
mirror reading, and code substitution. He found that both cases of distrib- 
uted practice resulted in better learning than massed practice. Figure 39 
shows the results for the mirror-drawing task. We can see that the time per 
trial for the two groups with distributed practice dropped much more rap- 
idly than the curve for the massed-practice group; the difference is apparent 
almost from the very beginning of practice. Notice also that the difference 



Figure 39. The effect of distribution of practice upon mirror drowing. Notice that, apart from 
the very first trials, there is almost a constant difference between the different groups. (After Large, 

1930.) « K V y , 
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between the group that rested for one minute between each trial and the 

group that rested for one day is very small. 

This experiment illustrates the basic effect of distribution in a perceptual- 
motor task of moderate difficulty. As is typically the case for such tasks, 
the difference produced by some distributed practice compared with none 

is large and consistent. 

The effects of distribution w hen practice is unequal. In the experiment 
just cited the amount of practice for the various groups was held constant 
and the time over which practice occurred was varied- What happens, how- 
ever, when the time over which practice occurs is held constant and the 
amount of practice varied? In this case, a group of subjects under massed 
practice would receive more practice than one under distributed practice. 

Duncan (1951) studied the effect of distribution of unequal practice on 
the acquisition of the ability to keep a small stylus on a constantly moving 
target (rotary-pursuit test). He allowed some of his subjects to practice 
continuously; the remainder of his subjects were periodically interrupted for 
a rest period. The rest periods actually occupied two-thirds of the practice 
period, so that the group practicing continuously had three times as much 
practice. Despite this heavy balance in favor of the massed-practice group, 
distributed practice produced better performance. The distributed practice 
group maintained the superiority it had shown during the initial learning 
after a pause of 10 minutes. Therefore it is clear that the effects of distribu- 
tion are powerful and consistent. 

Length of work and rest periods. It is reasonable to suppose that the 
relative effectiveness of distributed practice would depend upon the abso- 
lute and relative lengths of the work and rest periods. There are a number 
of experiments that explore this problem and provide us with important 
clues about the underlying mechanism responsible for the distributed- 
practice effect. 

Kientzle (1946) studied the effect of varying the rest period while hold- 
ing the work period constant. The measure of performance, she used was 
the number of alphabet letters subjects could print upside down in a one 
minute trial. She varied the rest intervals between trials from zero to seven 
days. Her results can be seen in Figure 40. In general, her experiment 
showed that for this task small rest periods resulted in a great improvement 
over no rest between trials, but that beyond 45 seconds, not much advan- 
tage was gained by increasing the rest interval. 

In another experiment on the alphabet-printing task, Kimble (1949b) 
showed with trials of 30 seconds duration that the advantage of spacing 
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trials kept on increasing from zero to 30 seconds rest. Since Kimble did not 
study distributions greater than 30 seconds, we do not know whether there 
would have been any advantage gained from even longer rest periods. 

In the two experiments just discussed, the duration of the work period 
was kept constant. What is the interaction between the length of the work 

one experiment (Kimble and 
Bilodeau, 1949) the task was rapid manipulation of small pegs, and it was 

found that shortening the work period is much more important than length- 



ening the rest period. Thus, the optimal rest period for any given task prob- 
ably depends upon the duration of the work period during practice. From 
the available evidence, it is likely that short work periods and short rest 
periods are most beneficial to practice, but the actual duration of the work 
period wiU clearly depend upon the nature of the task. It will not do, for 


example, to break practice into unnaturally short units for the task in ques- 
tion, interrupting someone solving a puzzle before he has reached solution 
may well have disrupting effects upon practice. 

The interaction of distribution and nature of the task. A large literature, 
only a fraction of which is cited here, supports the contention that some dis- 
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tribution of practice is better than none for nearly all psychomotor skills. 
We do not know very much about how distribution of practice interacts 
with the characteristics of specihc tasks, however. For theoretical reasons 
(see below) it has commonly been supposed that distribution ought to be 
more effective for tasks involving a large amount of physical work and 
effort expenditure than for those involving little work and effort (see Ellis, 
1953), but there has been little systematic exploration of task-distribution 

interactions for psychomotor skills. 

Underwood and his associates, however, have studied rather exhaustively 
the relationships between distribution of practice and various conditions pre- 
vailing in rote verbal learning. The results are rather complicated. For ex- 
ample, one study (Underwood and Goad, 1951) showed that distributed 
practice was superior to massed practice for learning lists of adjectives 
by the method of serial anticipation if the similarity between adjectives was 
high. If the similarity between adjectives was low, massed practice was as 
effective as distributed practice. On the other hand, irrespective of the 
similarity of items, learning adjectives by the method of paired associates 
was unaffected by distribution (Underwood, 1951). Again, to make the 
picture more complicated, Oseas and Underwood (1952) find that dis- 
tributed practice benefits the learning and retention of concepts, when the 
method requires subjects to abstract concepts by being exposed to succes- 
sive examples of the concept. Furthermore, comparisons of the results 
achieved by Underwood and his associates with those achieved by another 
study (Hovland, 1949) suggest that the level of skill or sophistication of 
subjects in laboratory learning may interact with the distribution of prac- 
tice (Underwood, 1953b), 

The net result of all the intensive work on the effects of distribution of 
practice on a particular kind of learning has been to show that it is ex- 
tremely difficult to apply a simple theory to all of the known phenomena of 
the distribution of practice. This is important, because much of the interest 
of experimental psychologists in the distribution of practice is not so much 
in the effect itself as it is in the means the effect has provided for testing 
certain theoretical notions about learning and skilled behavior. We shall 
return to some of these problems when we consider theories of the effects 
of distributed practice. 

The permanence of the effects of different distributions. We have seen 
that for the vast majority of psychomotor skills, distributed practice is su- 
perior to massed practice. For many verbal tasks this is also true. It is to 
be noted that wherever the effects of distributed practice have been studied 


the psychology of learning 

.raining figh,„ pi,a„ 8„ „ claLriyTord to 

a fixed an.ou„, of .„„o„i,io„ was dis.Hbu.ed over niany frainto „i 

om .he percemage of targe, hi.s was as nruch as five ,i„es be.,er .haTIf 
all the ammunition was fired on one mission. 

An important question we have not yet answered concerns the perma 
nence o the effects of distributed practice. If we teach two different gZps 

with ^ ^ one group practicing with distribution and the othL 

with massed practice, the chances are that we shall establish a difference 

m performance at the end of practice. If we wait a period of time however 

before measuring retention, will the difference between these two groups’ 
disappear or be maintained? ^ ^ 

havZ^Z-^^ experiment in which the permanent effects of distribution 

soTeZat ren^^n^scence. Reminiscence is a 

woTd bu it ° context of the 

we shall *" experimental research on learning, and 

skil stp and T’ '^hen individuals practice at some 

menttis rr'f H ® ^^"’^'’-oence is that there is an actual improve- 
Zk placT) ' (as there would be if forgetting 

an Jtd n h"*'! reminiscence occurred in rote verbal learning 

such I? phenomenon in 

limitd t T’ I " *" ^"^bal learning it is 

limited to the leammg of material like nonsense syllables when practice is 

severely massed (Hovland and Kurtz, 1951 ). Under these conditLs there 

Z Z f ^ ^ increase in the ability to recall nonsense 

syllables after a little time has elapsed. 

If the phenomenon of reminiscence were limited to rote verbal learning 
of nonsense syllables it would not be of much significance, since it occurs 
n limited amounts and only under highly specialized conditions for even 
this kind of learning. It turns out, however, that considerable amounts of 
reminiscence occur in the learning of psychomotor skills. 

The importance of reminiscence to the problem of the distribution of 
practice is that it seems to be a function of the massing of practice. In the 
case of nonsense syllables, no reminiscence occurs with distributed practice 
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ear, with the passage of time some of the difference induced by massed 
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and distributed practice disappears— the massed-practice group performs 
more like the distributed-practice group. Two important questions arise. 
Does this difference in reminiscence between massed and distributed prac- 
tice occur in psychomotor skills? If so, does the difference between massed 
and distributed practice entirely disappear after a time interval? 

The answer to the first question appears to be clear enough. In psycho- 
motor skills, the amount of recovery during a rest period seems to depend 
upon the amount of massed practice that has gone before. The second 
question poses a more complicated problem. Under some conditions the 
difference between massed and distributed practice does entirely disappear. 
For example, in an experiment by Kimble ( 1950) on a psychomotor skill, 
subjects practiced at the task under either severe massed or distributed prac- 
tice. After a six-minute rest period, despite large initial differences in per- 
formance between the two groups, the group that practiced under massed 
conditions did as well as the one trained under distributed conditions. In 
other words, reminiscence from massed practice made recovery from the 

effects of massed practice very nearly complete. 

This case of complete or nearly complete recovery (some ambiguity 
exists because of warming up when returning to a task after rest) is by no 
means the rule. It may, indeed, be the exception. For much the same kind 
of task, Jahnke and Duncan (1956) discovered that a difference produced 
by massed or distributed practice remains after an interval as long as four 
weeks, although the difference between massed and distributed practice 
after so long an interval is smaller than it was originally. Reminiscence oc- 
curs, but not so that the difference between the massed and distributed 
groups is wiped out. 

In verbal learning, much the same sort of situation occurs. Sometimes 
reminiscence shown by a massed-practice group of subjects will wipe out 
performance differences between massed and distributed practice and some- 
times it will not. Even if reminiscence as usually defined does not occur, 
it is possible that the difference between massed and distributed practice 
could disappear in a later recall. This could happen if both the massed and 
distributed group forgot enough items to obtain equally poor recall scores. 
Again, on this issue, the results of different experiments are not in 
agreement. Retention of verbal material learned by rote methods (either 
serial-anticipation or paired-associate) may or may not be enhanced by 
distributed practice (see Underwood, 1953a). The reason for this lack of 
uniformity in experimental findings is important, and we shall return to it in 
a discussion of the theoretical issues in the distribution of practice. 
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Let us look at an example of the bilateral transfer of reminiscence (Irion 

usta son, 1952). In this experiment subjects practiced continuously 

for five minutes at following a moving target (pursuit rotor). After prac- 

ice, one group of subjects immediately received five minutes additional 

practice with the other hand, while subjects in a second group rested for five 

minutes before changing hands. It turned out that the group that rested 

was significantly superior with the other hand to the group that transferred 

immediately. Thus the detrimental effects of massed practice transferred 

rom one hand to the other; they were not limited in location to the muscles 

involved in the original practice. This effect has been repeatedly confirmed 

(see Rockway, 1953), and it must be seriously considered in the theoretical 
issues concerning the distribution effects. 

Theories of distributed practice. Current attempts to account for the 
comparative effects of massed and distributed practice for the most part 
have emphasized the role of inhibition from massed practice. Some people 
have argued that the advantage from distributed practice is mainly the re- 
sult of rehearsal during the rest intervals, but the evidence is against this 
notion on a number of counts. First of all, many experiments have been 
controlled for possible rehearsal, and these have still found superiority in dis- 
tributed practice. Secondly the effect is more stable and larger in magnitude 
with psychomotor skills than with verbal material, and, intuitively, it would 
seem that verbal material would be easier to rehearse away from the testing 
situation than psychomotor skills. Finally, the effect also occurs in animal 
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learning (Bunch and Magsdick, 1933; Holland, 1953), where rehearsal 
is improbable. 

We have already had occasion to examine an important theory about 
inhibition. In the chapter on extinction, we examined a theory of extinction 
that attempted to account for the experimental evidence on the basis of two 
inhibitory states. One inhibitory state, reactive inhibition, is the simple re- 
sult of responding. It builds up through continued responding and disap- 
pears through rest. A second inhibitory state, conditioned inhibition, de- 
pends upon the first; the organism learns not to respond (or, perhaps, not 
to respond well) as the result of being induced not to respond through 
reactive inhibition. This notion, originally advanced by Hull (1943), has 
been modified and extended by several investigators. It was originally put 
forward to account for some of the basic effects in experimental extinction. 
As we have seen, it was only partially successful in this context; something 
like the notion of reactive inhibition seems to have survived experimental 
test— there does seem to be some temporary inhibition to further respond- 
ing generated by behavior— but the concept of conditioned inhibition did 

not survive the experimental onslaught in the study of extinction. 

Since, as we have just seen, the decremental effects of massed practice 
seem to result in both a temporary inhibition (hence recovery as it is shown 
by reminiscence) and a relatively permanent decrement, several investi- 
gators have assumed reactive and conditioned inhibition to be operating; 
they have tried to find techniques to evaluate unequivocally the role of these 
inhibitions in the distribution of practice phenomenon (Ammons, 1947; 
Kimble, 1949a). The logic of the technique for assessing the role of these 
two inhibitory factors is straightforward. Two groups of subjects are com- 
pared, one group working under massed and the other under distributed 
practice. The difference in performance between the two groups at any 
point represents the sum of reactive and conditioned inhibition. At some 
point a rest period is introduced (to allow reacuve inhibition to disappear) ; 
the gain in performance after rest represents the amount of reactive inhibi- 
tion that has disappeared, and the difference, after the gain, between the 
massed and distributed group represents conditioned inhibition. 

Despite the simplicity of this analysis, it has many pitfalls. First of all, 
it assumes that all the reactive inhibition dissipates during rest. Secondly, it 
assumes that practice in the distributed group was so well separated that 
no inhibition occurred. Thirdly, it assumes that despite a difference in per- 
formance the two groups learned equally (it would not do to have a 
difference in habit strength as well as in inhibition between the groups). 
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Finally, it does not take direct account of the warm-up that must occur 
after a rest. All m all, this is a rather forbidding list of assumptions; so 
we see that the simplicity of the analysis is more apparent than real. 

ome investigators have tried to control or analyze the factors going 
into these assumptions (Wasserman, 1951), but the results have not al- 
ways been convincing. Therefore an exact test of the application of formal 
mh.b.tory constructs to the massed and distributed-practice phenomenon 
has not yet occurred. Furthermore, as Underwood pointed out in his previ- 
ously mentioned series of papers on distribution in verbal learning, a whole 

host of detailed experimental results cannot be accounted for by these 
notions. 

At the most general level, we can say that there appear to be at least two 
components to the distributed practice effect, a permanent and a temporary 
one. Furthermore, neither of these seems to be localized in peripheral struc- 
tures; they are probably central in origin. Beyond this we cannot specify. 

It IS likely, however, that some of the difference between massed and dis- 
tributed practice in recall, even when the recall is separated in time from 
original practice, reflects an underlying difference in what has been learned, 
n an earlier section we saw that for most complicated tasks, doing is im- 
portant to learning. Therefore if the decrement produced by massed prac- 
tice IS severe enough to prevent individuals from performing adequately, 

It IS likely that they will not learn adequately. Even so general a notion as 
t IS does not allow us to understand in detail some of the effects of distri- 
bution. For example, Underwood (1953a) points out that in rote verbal 
earning by serial anticipation, items anticipated frequently during learning 
are recalled better after massed practice, while items infrequently antici- 
pated during learning are recalled better after distributed practice. This 
suggests some kind of interaction between distribution and rate of for- 
getting. If this is so, then the principles governing the permanent effects of 
the distribution of practice for material that is easily forgotten will be 
different from those governing the same effects for material less easily for- 
gotten. Again, this interaction may well account for the greater consistency 
of findings on the effects of distributed practice on psychomotor skills, 
since these skills are about equally easy to remember. 

The earlier theories and to a certain extent the current theories of the 
distribution of practice are based upon inadequate empirical exploration. 

This has not been because there has been a lack of experimental investiga- 
tion but rather because the problems are complicated. The future course 
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of theoretical analysis of this problem will require a further systematic in- 
vestigation of the principal variables. 

Information and Effect in the Learning of Skills 

There is a famous demonstration by Thorndike (1932b) of the im- 
portant fact that the consequences, or aftereffects, of an act are important 
determiners of the future course of that act. If a blindfolded person is re- 
quested to draw, with one quick move of a pencil, a line 3 inches long, his 
accuracy will be poor. If he continues to “practice'’ without being able to 
see what he is doing, his performance will not materially increase in ac- 
curacy, though it may become less variable. However, if an experimenter 
measures the drawn line and says “right" if the line is within 14 inch of 3 
inches and “wrong" if the line deviates by a greater amount, the subject 
will rapidly show improvement in his ability to approximate the 3-inch 

i criterion. 

This demonstration has been the subject of an enormous amount of 
misinterpretation. It is meant to show the importance of the aftereffects of 
a response in determining its direction of change. By implication it is fre- 
quently taken to mean that the reward (“right ) satisfies the subject s mo- 
tivation to increase his accuracy and that this has something to do with the 
improvement. To argue whether or not this is so easily leads us into a 
morass of slippery concepts. It may well be true (depending upon how 
systematically defined our notions of satisfaction and motive are), but 
there is no critical reason why it is necessary to describe this demonstration 
in these terms. The line-drawing demonstration shows only the operation 
) of the empirical principle of reinforcement, which says that there is a class 

of stimuli to selectively strengthen every response. 

“Right” as a reinforcement is trans-situational (see Chapter 2); that is to 
say, it will strengthen responses of a wide variety of classes. Almost any 
sort of information that leads the human learner to make what the experi- 
menter or teacher defines as a correct response is a reinforcement, though 
some such informational reinforcements are clearly not trans-situational. 

Knowledge of results in a training program. A good example of the 
difficulties involved in getting the correct information to the learner is found 
in flexible-gunnery training. In firing guns it is sometimes difficult for the 
trainee to know if and when he hits the target. During the Second World 
* War various training devices were developed to give individuals information 
about their accuracy in firing. One of these, the Waller trainer, was a screen 
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which simulated the sky and against which images of planes were thrown- 
a simulated gun and sights enabled the trainees to fire at the images. The’ 
apparatus was rigged so that it would score a “hit” whenever the individual 
had the sights lined upon the image of the plane and was puUing the trigger, 
n addition, there was a tone— a “beep”— which was sounded whenever 
the individual made a hit. This could be thrown in or out of the circuit so 
that the effect of this piece of information could be studied. Figure 41 tells 
the story. It is obvious that when the tone was sounded for a hit, the sub- 
jects did much better. When the tone was sounded early in practice it re- 
sulted in a greater number of hits. Later, when the tone was withdrawn from 



’9 1- The effect of knowledge of results upon learning to hit a target. Whenever a hit was 

, 0 tone was sounded. One group received the tone at first and not later. The other group 

received no tone at first but then heard one on the later trials. (Hobbs, 1947.) 

this group and given to the second group, the second group immediately 
increased in number of hits, so that it now surpassed the first group. Unfor- 
tunately, the investigators did not include a third group that was never 
given the tone. 

The same group of studies (Hobbs, 1947) shows that where there was 
little opportunity for knowledge of results, as in the gun camera mounted 
in the tail of a bomber, relatively little improvement in score took place. 
This fact ruled out several otherwise excellent, realistic training devices. A 
second important point is made by the same studies: Sometimes the subject 
gets the wrong information. Knowledge of results of training with a sta- 
tionary gunsight and target might be misleading, because in actual prac- 
tice the gunner must fire at a moving target from a moving plane and must 
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ieam to “lead.” Thus knowledge of results on a training device could actu- 
ally lead to negative transfer on the job. While this may not happen often, 
it is something which needs to be watched very carefully in any training 

program. 

Delay oj information and learning. In examining the influence of rein- 
forcement upon animal behavior we saw that a delay in reinforcement may 
disrupt behavior and slow down the rate of learning. Is the same thing true 
of reward and information in adult human learning? At first thought we 
should probably say no, because the symbolic processes in man would 
enable him to bridge anything but an enormous delay between the occur- 
rence of a response and the rew’ard or information that comes as its conse- 
quence. Let us, however, examine some of the experimental evidence. 

In one experiment (Lorge and Thorndike, 1935) subjects learned to 
toss a ball at a target they could not see. Some subjects were told how well 
they did on a particular throw'. immediately after the throw, for other sub- 
jects the information was delayed for various short intervals of time. In 
general, as one might expect, there was little or no difference due to vary- 
ing intervals of delay in the learning of this task. If, however, the interval 
between throwing the ball and the information about that throw was filled 
with another throw, the gain in accuracy was impaired. This result suggests 
that it is more than simply the passage of lime that is important in the effects 
of delay upon information and reward (we concluded much the same thing 
in the case of delay of reinforcement on animal behavior). 

Saltzman (1951) has demonstrated that a brief delay in reward and 
information in rote verbal learning can have considerable effects upon the 
number of errors made by subjects during learning. A delay as short as six 
seconds increased the number of errors by 50 per cent. Saltzman points 
out that his subjects rehearsed the response between making the response 
and the delay of information, and it is quite possible that rehearsing in ad- 
vance of knowledge of the adequacy of the response made may lead to 
slower learning. A series of systematic experiments on the influence of such 
factors is needed. 

"Spread of effect” and information. It is quite clear that information 
about the adequacy of his response allows a learner to adjust his perform- 
ance more nearly to approximate a norm. Thorndike argued for many years, 
however, that this was not the only way to look at reward and information. 
He believed that rewards had automatic effects on responses that were over 
and above any deliberate use of information on the learner’s part; rewards 
“stamped in” preceding responses or stimulus-response connections. 
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vintrSoi;rzr„r r - 

Thorndike’s ^ ^ automatic, blind fashion. 

0 ^. f ^ experiments seemed to show that the conse 

q nces of a response (reward or punishment) were not specific to the 

sponses temporally adjacent to that response. ^ 

Thorndike used was rather unusual 
Subjects are given a series of items to which a number of resLses “re 

possible, as in a multiple-choice test. The subjects are instructedTo choose 
know whfch re*" does not 

responses a. .he beginning should be fairly randon,. In oneIxperiZm fo 

b^g” easing 'a ntm^ “l""^ 1“ '“P”* “ 

oy guessing a number from 1 to 10. The experimenter follows the subiect’s 
number with “right,” “wrong,” or nothing at all. The subject Relieves t^at 
he must learn to associate the right number with the right word The L- 
perimenter, however, has decided beforehand which of the words he wW 

mo is told that 

Ty in whicTh ““""r ‘ ‘here is no 

way m which he can know that there is nothing for him really to learn 

e effect Thorndike found was that the statement “correct” not only 

ncreased the probability of repeating the response that it followed, but it 

increased the probability of repeating the responses before and after the 

Reure ^ spread-of-effect gradient in 

Mgure 42. The responses on either side of the middle response had nothing 

following them; however, they did increase in per cent repetition over 
chance expectancy. 

At the time Thorndike’s results were published, they did indeed seem 
o give strong support to the notion that the consequences of a response 
operated by the principle of effect as well as by information. In the inter- 
vening years, however, a number of complications have been found in this 
kind of experiment. First of all, it appears that it is not a stimulus-response 
connection that is strengthened; it is just the habit of guessing one particular 
number after some other number (Zirkle, 1946). Even when the order of 

items given to the subject is changed on every trial, the spread of effect 
remains. ^ 

to show almost overwhelmingly 
that the spread of effect in this type of experiment is primarily due to the 

fact that human beings seldom respond randomly, and the effLt does not 
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seem to be related to the principle of effect at all. These experiments show 
that the guessing habits of subjects are the important determiners of the 
effect. When no opportunity for guessing habits is permitted, no spread of 
effect appears (Jenkins and Sheffield, 1946; Jenkins and Postman, 1948). 
On the other hand, Sheffield (1949) has shown that pure guessing without 
any information or reinforcement will result in an aftergradient like that 
found in the spread of effect. So it seems that the dependent probabilities 
in human verbal behavior provide the background for the spread of effect 
(Smith, 1949). Finally, Sheffield and Jenkins (1952) have shown that the 
more closely an experiment approximates Thorndike s original one without 



serial position, N-- neutral R =''RIGhT" 

Figure 42. The spread-of-effect phenomenon. The probability of items adjacent to a rewarded 
item in a series is increased, as lllustroted in this hypothetical example. 

the presence of reward, the closer one gets to a typical spread-of-effect 
gradient. Thus there seems to be little doubt that Thorndike’s original 
spread of effect was an artifact. 

Some investigators have pointed out the possibility of another, more 
general kind of gradient (Marx and Bunch, 1951; Marx and Bernstein, 
1955). This is not based upon temporal proximity to the reward, but upon 
resemblance of other items to the rewarded item. Such a gradient would 
appear to be reasonable under certain conditions, but it is not the same as 
spread of effect, and it adds nothing to the notion of the automatic strength- 
ening effect of reward. Rather a reward appears to function ( 1 ) to keep 
activity going (as it does for animal behavior) and (2) to provide informa- 
tion about the adequacy of past behavior and the direction future change of 
behavior should take. 
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Other Conditions of Practice 

Of course there are a large number of factors that determine the effleiency 

they are unsystematic arrangements of mote basic vatiabies. For example 
consider the problem of active participation in learning. ^ ’ 

riol participation in a task yields supe- 

rior performance, recall, or transfer for that task. For example, interruption 

nesroftheT "’“1‘iple-choice questions increases the effecdve- 

thoLh ne h " ‘""‘^hing device (Kurtz, Walter, and Brenner, 1950), 
gh perhaps, rather surprisingly, no more so than simply repeating the 

film a second time. Obviously, the questions provide more active participa- 

lon on the part of the viewers than does merely watching the film. 

en we ask why active participation is more effective than passive 
participation, we immediately see that it is a complicated question. Par- 
ticipation varies for different tasks, and it must have somewhat different 
effects. It may supply the learner with knowledge of results with short 

elay, increase the learner’s motivation, or help him eliminate errors early 
in practice. ^ 


What IS surprising, perhaps, is that retention of verbal material is better 

when time is taken away from reading or study of the material and put into 

active recitation. In one study (Gates, 1917) subjects who spent up to four- 

fths of their total study time in recitation rather than in reading showed 

t e greatest improvement. The advantage gained from recitation is fre- ^ 

quently large, and this is one of the points about habits of study that is 

emphasized in manuals on study habits (see Robinson, 1946). Very likely 

recitation is important to good study habits because it provides direction 

to the learner’s work and readily supplies information or knowledge of re- 
sults about performance. 

Another problem that sometimes arises in application of principles of 

learning to formal training or schooling is the question of whole and part 

leammg. This originally arose in the context of rote memorization, where 

perhaps it makes most sense, and it was concerned with whether it is more 

eflScient to memorize by reading the material from beginning to end each > 

time or by breaking the material into its constituent parts and studying each ^ 
separately. ® 

A little reflection will show that even in the limited context of rote 
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memorization, the question of whole and part learning is complicated. What, 
for example, is the role of length of task? Perhaps for short tasks, such as 
memorizing a sonnet, the whole method would be better, whUe for longer 
tasks, memorizing a book from Paradise Lost, for example, the part method 
might be more efficient. Unfortunately we do not have systematic informa- 
tion to answer the question about the relative efficiency of whole and part 
learning for all tasks. We cannot even make a reasonably sure guess, as we 
might in the case of massed versus distributed practice. We do know that 
some conditions favor whole learning and others favor part (Jensen and 
Lemoire, 1937), but we do not know enough about the interaction of this 
problem with other methods of practice to make clear recommendations 

about a wide variety of tasks. 

TASK VARIABLES 

Let us now consider how some of the important differences between 
various tasks affect learning. No task that a human being learns is quite 
like any other. Some of the differences between tasks are trivial and others 
are of limited general interest. There are, however, a number of ways in 
which tasks differ that are fundamental to the psychology of learning. 

First of all there is the important difference between those tasks that are 
primarily verbal and those that are not. Then there is a dimension of per- 
ceptual emphasis along which tasks vary; some are primarUy a matter of 
learning to make discriminations and to form new perceptual organizations. 
Other tasks are primarUy a matter of adjusting motor performance to meet 
some norm of skUl. Stiff others are matters of conceptual organization. In 
tasks that demand simple memory, there is an important dimension that 
runs from demanding memorization of a collection of unstructured elements 
to demanding memorization of highly organized and meaningful material. 
Compare, for example, an attempt to memorize a list of 100 nonsense sylla- 
bles with an attempt to memorize 100 successive words from a story in a 

popular magazine. 

Then there are the two important dimensions which run through prac- 
tically all types of tasks— difficulty and length, or amount, of material to 
be learned. Though we regard these as fundamental dimensions, they are 
not necessarUy pure. Task difficulty is frequently correlated with some other 
way in which tasks vary (meaningfulness, for example), and it is by no 
means a simple variable. As a matter of fact, such variables as task diffi- 
culty are probably only rough empirical classes, and the future development 


204 


THE PSYCHOLOGY OF LEARNING 

“Otions altogether. 

, owever, a considerable part of our understanding of the basic 

r”T P^y^^hology of learning is tied up with such variables as 
difficulty and amount, and we mncf r^^nrx^A ♦! «« 


Difficulty 

lea^ g^n^rally one that takes us a relatively long time to 

am. Yet, it is not possible to equate difficulty with the length of time or 

the number of trials it takes for a learner to reach some arbitrary rto 
ihties of the learner, his motivation, the amount of preceding practice 

time takes to reach some criterion of performance. We can, however 

budon oJ "" conditions, such as distri ’ 

task in r ‘^o^^tant and vary some aspect of the 

sk m such a way that the Ume taken to reach a criterion is changed. 

Difficulty and Response Characteristics 

Nurnber of available responses in trial-and-error learning. Trial-and- 
error learning generaUy refers to those situations in which learners cannot 

ff f knowledge, and in which they 

must t^ to find the correct solution by exploring the available alternatives. 

such learning an important determiner of difficulty is the number of 

choices the learner is given (Noble, 1956). The larger the number of al- 

ernatives (assuming the number of correct alternatives remains the same), 

rM difficulty of the task. For example, in one experiment 

(Noble, 1955) subjects had to leam to push four buttons in a particular 
sequence; m this situation the number of trials required to reach an arbi- 
trary criterion was enormously increased when the number of available 
uttons increased. If only four buttons were present, they were all relevant 
to the task, but when the buttons were increased to 10 in number, 6 of them 
were irrelevant and the subjects had to learn to eliminate these. 

Interaction between number of available responses and task length. 
e influence of number of alternatives on performance is modified by the 
ength of the task. This is demonstrated in a series of studies by Broeden 
and Schmidt (1954^, 1954b). These investigators studied the effects of 
increasing both the number of response alternatives available at each choice 
point and ffie total number of choice points. They used a verbal maze, one 
m which the learner must find his way by guessing which of a number of 
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available responses at each choice point will allow him to go on to the next 
choice point and eventually the end of the maze. In their experiments, Brog- 
den and Schmidt studied two different lengths of maze, one 16 units long 

and another 24 units long. 

The results of this study can be seen in Figure 43. Both the number 
of errors and the time taken to reach a criterion score increased linearly 
with an increase in the number of alternatives. It is interesting to note, how- 
ever, that for the two mazes of different length, the rate of increase in time 
and errors with increasing number of alternatives is different. Apparently, 
the longer the maze the greater the difficulty added by increasing the num- 
ber of alternatives at each choice. A careful examination of the curves in 
Figure 43 will reveal that the increase is disproportionate in the case of the 




NUMBER OF ALTERNATIVE CHOICES PER 

UNIT 


Figure 43. Curves showing the increase in errors and time to criterion with an increose in the 
number of choices at each choice point in verbal mazes. The dotted curves are for mazes 16 units 
long and the solid ones for mazes 24 units long. {Brogden and Schmidt, ]954fa.) 


longer maze. In other words, the greater time and greater number of errors 
to criterion is not simply accounted for by the fact that there are more 
choices to be made; increasing the number of choices and increasing the 
length of the maze seem to have different effects. 

In these studies one important feature turned up that is of critical interest. 
In general, increasing the number of alternatives at each choice did not in- 
crease the number of trials required to criterion. Thus, while the total number 
of errors made by subjects and the total time to criterion were increased by 
adding to the number of choices, the subjects did not increase the number 
of times they had to go through the maze in order to reach criterion. This 
is in contrast with the results found by Noble (1955), and the answer lies 
in the fact that Noble’s definition of trials differs from that of Brogden and 
Schmidt. In Noble’s study trials means the number of guesses the subject 
makes; Brogden and Schmidt define trials as the number of times the sub- 
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ject had to go through the diflferent choice points. Miller f 1956 a ^ 

r r 

, does make a difference as we shall shnrtiv 

erefore the total time, since errors take time), because the greater the num 
ber o( a„ern.uv=, d,e lower .he probabili.y of making a corre "guela If l^k 

rorr.£-"?T~“ 

Lc; poll t b' 

numS alternatives wiU increase the 

mber of errors made durmg learning, it will not increase the difficulty of 
remembering the correct response. ^ 

Difficulty and Characteristics of the Material Learned 
of f ^ to be learned and difficulty 

would if r automobile? Such a question 

an th general value, since these two tasks differ from one 

«,h ^ If, however, we compare tasks 

ich differ in only one characteristic we can learn something about how 
variation contributes to the difficulty of tasks. 

Association value. One important characteristic of verbal material is 

1 s richness of contextual association. Some words have a greater potential 

or e iciting associations than other words. Even nonsense words or syllables 

differ enormously in their potential for eliciting associations. Various lists 

o nonsense syllables have been carefully examined for their association 

alue, and it is possible to choose syllables of different value for different 
Kinds of experimental studies. 

Noble (1952a) has made use of an index of meaning applied to two- 
y a e nonsense and ordmary English words. This index was obtained by 
counppg me average aambe, of aasocLa.lop, given daring a 60-^oS 
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period to each word. The obtained index ranged from 0.99 for the word 
GOJEY to 9.61 for the word kitchen. It is interesting to note that such a 
nonsense word as ROMPin has a higher derived index of meaning than such 
infrequent but real English words as icon, matrix, and bodkin. Noble 
(1952b) used these words in a study of rote learning by serial anticipation. 

It turns out that a list constructed of words with an average meaning index 
of 1 28 took almost three times as many trials, on the average, to learn as 
a list constructed of words with an average meaning index of 7.85. These 
results, taken together with earlier studies of three-letter nonsense syllables 
(McGeoch, 1930), make it clear that associative potential of individual 
verbal items is an important determiner of ease of learning. It is wort 
noting that associative potential plays essentially the same role m paire - 

associate learning (Noble and McNeely, 1957). 

Similarity. A variable of considerable importance from a theoretical point 

of view is that of similarity among items in the material to be learned. 
Similarity, as we shall see in later chapters, is a rather complicated dimen- 
sion. Any task consists of stimulus and response components. Both may 
vary in similarity among the separate items and both may vary with respect 
to each other. In rote serial learning the same item serves as both stimulus 
and response in succession, while in paired-associate learning there is a 
clearer separation between stimulus and response elements. Therefore, we 
might expect some differences to exist between the effects of variation in 
stimulus similarity and in response similarity. Hence, tasks m which there 
are different relationships between the items must be considered separately. 

If we consider the rote serial learning of verbal items we see that an 
increase in the similarity between items generally results in an increase m 
the number of trials to criterion. For example, Underwood and Goad 
(1951) showed that lists composed of adjectives highly similar to one 
another, such as “elated,” “gleeful,” “carefree,” took more trials to learn 
than lists composed of unrelated adjectives like “worldly,” “fiery, 
“blotchy.” The same thing is apparently true of nonsense syllables, in which 
the similarity is achieved, not by meaning, but by duplication of letters 
(Lazarus, Deese, and HamUton, 1954; Underwood and Richardson, 1956). 

Underwood (1951) has also studied the influence of similarity upon 
paired-associate learning. In his study he compared learning of lists com- 
posed of distinctly different adjectives with learning of lists in which smilar 
adjectives were used as stimuli and responses. In general, the more simUar 
the adjectives the larger the number of trials to criterion. Unfortunately, 
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while holding stimulus similarity constant From similarity 

•ti-r =; 

learners should have a harder ^^hly similar, 

tinctive properties " """ remembering their dis- 

In general, it is probably safe to conclude that mWn a given task hieh 
of thlt fask wT^sho response components increases the difficulty 

the tasf ‘^---'ninate among all the components of 

a given st imir ^ to 

learning ThTc ‘^an retard 

and no -t t the essence of the difference between negative 

caser^r r opportunities to exam J cb 

cases in the next few chapters. 


Amount to Be Learned 

a s^blStoT^' "" ^t“tly to tell us that the more material 

amount to he^ 7 " ^"‘^“'ted, or indeed, that a greater 

of the reLt k ^he exact nature 

aiount S T """" y learning and 

interesting. It is also somewhat sur- 
pnsing, and 1 wUl repay us to examine it carefully. A classical monograph 

I ^ summarizes much of the best available evidence on these 

relationships, and we shall draw heavily on Lyon. 


Amount to Be Learned and Time per Item 

"’^‘iningful way to present the 
relationship between amount to be learned and time to learn is not to com- 
pare the total learning time with the number of items for different numbers 
of u.™. bu, .o se. how ,=„e.h .» ,e„„ 

Such a companson can be seen in Figure 44. This figure shows the re ’ 
suits of two of Lyon’s experiments on the time required to learn lists with 
different numbers of nonsense syllables. In one experiment Lyon learned 
ho„.e™e .,„ab,„ by p,.c,ic., .„h ^ o,h„ by" ^ 


( 
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the list only once each day. The curves show the time per syllable as a 
function of number of different syllables. As we can easily see, the time 
per syllable increases rapidly with continuous practice. We can imagine, 
from our earlier discussion, that this very rapid increase in time per syllable 
probably represents something more than the inherent difficulty of learn- 
ing longer lists; this may represent the cumulation of response-produced 
inhibition as well. So, for our present purposes the curve for one trial 
per day is probably more meaningful. It is rather surprising that while 
there is an increase in the time per syllable with longer lists, it is not a very 



Figure 44. The time per syllable required to learn lists of nonsense syllables of varying length 
for distributed and massed practice. The data for these curves were obtained on one highly troined 
subject. (Data from Lyon, 1917.) 

great increase and appears to level off. That is to say, with small numbers 
of syllables the addition of a few more syllables makes a large difference 
in time per syllable, but with a large number, the addition of the same num- 
ber of items makes little difference in time to learn. 

Therefore, if a large number of items are to be learned by the method of 
serial anticipation with distributed practice, the addition of an item or 
two costs little more than the additional time required to read the syllable. 
This result is somewhat surprising, because intuitively one might expect 
that if one has a great number of syllables to keep in mind already, the 
addition of one more would indeed add to learning time. Thurstone ( 1930b) 
has shown that it can be derived from his general learning equation that time 
per item should increase as the square root of the number of syllables beyond 
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lHownr?" ^ conclusion is in accord with the data 
own m Figure 44 for the once per day method as well as data from other 
earher mvestigators. ’ 

hiriTfn ‘hat it took 

emonze poetry. The results with poetry were much like those he 

found with nonsense syllables. While additional material did requ'e an 
me. items in this case were stanzas of poetry. Of course, the material 

thaTetiTnd'vid r'"* 

man eacn individual nonsense syllable. 

If the meaningful material were completely redundant, of course each 

Tt ZTumT' I said 

If however tlT I ‘ake. 

7-fi 5 Tr^’o T 3-2-1-4-3-2-5-4-3-6-5-4- 

(assuming th * ^ easily get the principle and additional items 

(assummg they followed the principle) would take no increment in time; 

^chTlist 'm r ; ^^"8‘h of the list. 

Such a list would be a redundant list; one could reconstruct the entire list 

sZe TZ 3-2-1-4-3-2. Now all meaningful material is to 

sZ IH V, , therefore the additional time per comparable item 

Un i. I meaningful material than for some nonredundant ma- 
tenal like random lists of nonsense syllables. 

Amount to Be Learned and Number of Repetitions 

Nonsense syllables. If the exposure time of each item in a verbal learn- 
g periment is carefully controlled, the number of repetitions would be 
y proportional to the time. In Lyon’s experiments, however, he did 
contro exposure time, so that the correlation between time of learning 
n nuna er of items to be learned is not perfect. Indeed, when we look at 
the results of Lyon’s observations on the amount to be learned and number 
of repetitions shown in Figure 45 we see that number of repetitions and 

nZif 7 differendy. The curve in Figure 45 shows the 

umber of repetitions, or trials, per syllable necessary for learning from 

the same study as the distributed practice example in Figure 44. The num- 
ber of repetitions required per syllable increases rapidly at first, but then, 

called'afte" o^fyte'ri^r immediately re- 
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rather surprisingly, it reaches a maximum and begins to decline. The result 
is that Lyon required fewer repetitions per syllable to learn a list of 200 
nonsense syllables than he required to learn a list of 16 nonsense syllables 
(a little multiplication will show, of course, that the total number of repeti- 
tions was greater in the case of the 200 syllable list). These results are not 
peculiar to Lyon as a learner; in his monograph he summarizes results of 



Figure 45. Number of repetitions per syllable required to learn lists of nonsense syllables of vary- 
ing lengths by the once per day method. This curve is based upon the some study illustrated in 

Figure 44. (Ooto from tyon, I9?7.) 

experiments by Ebbinghaus and Meumann which show essentially the same 
thing, though not in so dramatic a form as Figure 45, 

Thus it seems probable that the number of repetitions does not increase 
with the number of syllables beyond some relatively small number, and 
that it may actually decrease, especially if the exposure time is uncontrolled 
as it usually was in these early experiments. What if the exposure period is 
carefully controlled? Unfortunately, we do not know what happens, since 
there are no data over a range of material comparable to that investigated 
by Lyon in which exposure time per item is controlled. These experiments 
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the ^rlhaTLvon did T' “vestigator has tackled them on 

would not decrea^se if this w Possible that repetitions per syUable 

4 <r » u • ^ comparison of Figures 44 and 

increase much with the number of syllables. The net result of these studies 
. to ™k. ..probable .ha. ,he inc.eas, ip an,o„„. of work (.i„e or ) 

lonZ 1,^^: span .o 

.wl'TZmrfTf ■ "'““o-ship be. 

vlrbal ZZl 1 T meaningful 

the e Ir It' " "" r' ^onfrol 

hke Lvon^ consequently we should expect his results to be much 
ke Lyon s. And in fact they are, when the differences produced by Gofer’s 

- '-to- 
rn etZld^HH "7^^ -‘-on as more 

wit teer t! Vh unit word actually decreases 

td at oef un H --ber of 

t, emphasize verbatim (rote) learning or logical 

r, ^ ^ ^^P^^^ncnts on nonsense syllables, 

because”''? th^t ®“bjccts were relatively unpracticed. This is, of course, 
ecause of the higher redundancy in meaningful material. 

The results of the avaUable experimental evidence clearly suggest that 

If '’^^tcrial does not necessarily disproportionately increase diffi- 

cu y o earning as has commonly been supposed both by learning theorists 

r ■ casually interested in learning. This conclusion, however, is 

imite y the word necessarily” because many of the important experi- 
mental parameters have not been thoroughly explored. It is possible, for 
examp e, that material with high similarity between items would result not 
on y in an over-all greater number of repetitions per syllable, but repetitions 

7 ^ 1 g^^^^tcr for longer lists composed of such ma- 
terial. This whole question of the relationship between trials to criterion 

n”‘ 't exploration, particularly since there is a 

readi V mT ?f ‘'^^-^ves 

readily to mterpretaUon m terms of measures of information. 
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TRANSFER OF TRAINING 


There is no more important topic in the whole of the psychology of learning 
than transfer of training. Nearly everyone knows that transfer of training 
is basic to educational theory. Practically all educational and training pro- 
grams are built upon the fundamental premise that human beings have the 
ability to transfer what they have learned from one situation to another. 
This assumption is evident in things as different as the curriculum of a col- 
lege of classical tradition and an adolescent who is persuaded by his mother 
to attend dancing school in the fond hope that training in the graces of ball- 
room dancing will permit him to navigate the length of the house without 
danger to less sturdy furniture. 

The problem of transfer of training arose in the context of educational 
theory, but it has always been fundamental to psychological theory as well. 
Indeed, the basic psychological problems in the transfer of training pervade 
the whole of the psychology of human learning, and in one sense the major- 
ity of the topics this book examines are problems in transfer. Thus, the 
stimulus generalization of a conditioned response, the chaining of responses 
in maze learning, and the retention of verbal learning all involve special 
cases of transfer of training. 

In this chapter we shall examine some major techniques for the study of 
transfer of training as a problem in itself, some conditions that lead to posi- 
tive transfer between tasks, and some theoretical principles that arise out 
of the study of transfer. 

THE MEASUREMENT OF TRANSFER 

Transfer of training means that training or performance on one task has 
influenced performance on another task. Thus the fundamental technique 
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m the study of transfer of training is to compare the performance on a par- 
ticular task of two groups of subjects, one of which has practiced on another 
task. There are a number of ways in which such an experiment can be per- 
formed and the comparison evaluated. These will be discussed in the foUow- 
ing section. 

Experimental Designs in the Study of Transfer 

The proactive design. The proactive design is the simplest arrangement 

in the study of transfer. Basically, there are two groups of subjects, one of 

which practices a preliminary task and both of which then are tested on a 

subsequent task. Thus the basic conditions can be summarized as follows: 
Experimental group 

Learns task A t ♦ i o 

^ , Test on task B 

Control group 

Rests ^ 

if task A has had an effect upon task B, these two groups should perform 

erently on the test of task B. If the experimental group performs signifi- 

cantly better than the control group on task B, we can say that proactive 

facilitation, or more generally positive transfer, has taken place. If, on the 

other hand, the experimental group performs significantly worse on task B, 

we can say that proactive inhibition, or negative transfer, has taken place. 

there is no difference between the two groups on task B, of course, no 
transfer has taken place. 

It takes at least two groups of subjects to perform a transfer experiment 

of this type. For many purposes, however, we may wish to increase the 

number of comparisons. We may have, for example, several variations in 

task A. In such a case we should want to compare each of the variations 

in task A with a control or rest condition. Furthermore, we may want to 

compare the amount of transfer between task A and B with the eflfect of 

direct preliminary practice on task B. Thus a more complicated proactive 
experimental design might be as follows: 

Group 1 : Learns task B g 

Group 2: Learns task Aj Test on task B 

Group 3 : Learns task Aa jest on task B 

Group C: Rests on task B 

t IS surprising to find out that there are circumstances when transfer 

rom one task to another is greater than the advantages of direct practice 

on the task to be tested. We shall examine some of the circumstances sur- 
rounding this unusual case later on. 


TRANSFER OF TRAINING 


215 


One further variation in the proactive design provides for a retention test 
for the second task. In the simplest example of this design, one group learns 
task A, then both practice on task B, and finally both are tested for reten- 
tion of task B. The design is as follows: 

Experimental group 

Learns task A . . . Learns task B . . Tested for retention on task B 
Control group 

Rests Learns task B . Tested for retention on task B 

This particular design is not so important to the theory of transfer of 
training itself. It is, however, fundamental in the application of the theory of 
transfer to an understanding of the forgetting process, and in the next 
chapter we shall have ample opportunity to see examples of the use of this 
design. 

The retroactive design. The other commonly employed experimental 
plan for the study of transfer is known as the retroaction experiment. In 
this design, an experimental group practices first on task A, then on task B, 
and finally on task A again. The control group practices only on task A 
for a time comparable to the experimental group. Thus the experimental 
design is as follows: 

Experimental group 

Learns task A Learns task B Test on task A 

Control group 

Learns task A Rests Test on task A 

The critical comparison is on the final test of task A. For readily apparent 
reasons, this design is sometimes called the “fore-and-after” method. It has 
the advantage that the initial practice on task A can serve to equate the two 
groups of subjects in performance and thus reduce extraneous variability 
on the final test. Furthermore the design enables one to examine directly 
the influence of learning task B upon the retention of task A, and this is 
sometimes very important. 

The control group in proaction and retroaction experiments. In the 
ideal experiment the control subjects would be held in absolute inactivity, 
so that the resulting differences between the experimental and control con- 
ditions could be entirely attributed to the task assigned the experimental 
group. In practice, of course, it is impossible to keep human or animal sub- 
jects completely inactive. This seldom is a serious matter with animal sub- 
jects, however, since we have good grounds for supposing that they are 
not engaging in activity relevant to the tasks they are to learn when they 
are away from the testing situation. Thus, it is unlikely that animals will 
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rehearse task A in a retroaction design. It is possible, however, that human 
subjects will rehearse or otherwise engage in activity appropriate to the 
tasks they are required to learn in the experiment. Consequently, in most 
transfer experiments with human subjects, the control group is not kept 
completely inactive. Rather it is usually given some task to perform in the 

rest” condition that keeps it occupied; this task is one that we know from 
previous work will have little or no effect upon the task in which we are 
tesUng for transfer. This is particularly important in studies of retention, for 
which rehearsal outside of the experimental learning situation would have 
serious consequences for the outcome of the experiment. 

The Evaluation of Transfer 

It is obvious that one evaluates the amount of transfer by comparing the 
performance of a control and experimental group of subjects. There are 
many different ways in which this comparison can be made, however, and 
sometimes they lead to quite different conclusions. Therefore the question 
of the evaluation of measures of transfer deserves some attention. 

The ideal transfer measure would be one that could be expressed as a 
percentage in such a way that 0 per cent means no transfer and 100 per cent 
means perfect transfer, or the same performance level that would have been 
achieved if an equivalent amount of practice had been devoted to the task 
on which testing occurs. Gagne, Foster, and Crowley (1948) have given a 
formula which meets these requirements. It is 

^oup )^- ( score, control group ) 

(Total possible score) — (score, control group) ^ 

This formula is particularly useful in the proaction design in which there is 
no fore measure on the task receiving the transfer to take into account. 

By this formula, if the score of the transfer group is the same as the con- 
trol group (which had not received practice on another task) there is zero 
transfer. If, however, the transfer group achieves the maximum possible 
score and if the control group does not (as would certainly be the case), 
transfer is 100 percent. 

This formula cannot be applied in every situation, however. In many ex- 
amples of learning we do not really know what the total possible score is, or 
it is so unrealistic that we may end up with a distorted picture of the 
amount of transfer obtained. Furthermore, in the retroaction design we 
may want to take account of the effects of preliminary practice on the task 
receiving transfer, and this formula ignores such effects. 

Expressing amount of transfer in such a formula is convenient, and it is 
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frequently done. It may however obscure the way in which transfer from 
one task to another occurs. It is quite possible, for example, that transfer 
may affect initial level of performance on a task but not rate of learning on 
that task (see Woodworth and Schlosberg, 1954). It is even possible that 
transfer could affect the upper limit of performance without affecting rate 
of learning, though this is unlikely. The point is, however, that expressing 
transfer in terms of some simple formula like the one above would obscure 
such differential effects of transfer. The particular parameter of the learning 
function that transfer affects may be very important in the practical evalua- 
tion of transfer in a training program (Lawrence, 1954), and we shall have 
more to say about this matter later. 

THE THEORY OF TRANSFER 

The Educational Backgrounds of the Problem of Transfer 

Educational theories which concern transfer of training are older than 
the experimental study of the learning process. One does not need to look 
far to see why transfer of training is something that must be considered in 
any theory of education. In the first place, as we have already mentioned, 
education assumes transfer, for aside from its intrinsic value, there is no 
point to education apart from transfer. Furthermore, when the empirical 
study of the learning of children began, it was apparent that there were 
many practical questions about the transfer of training that demanded 
solution. Consequently, early educational psychologists were much inter- 
ested in transfer; indeed, one could say that transfer of training was the 
central problem of early educational psychology. Since many of the theories 
and controversies that arose about transfer of training among the early 
educational psychologists are still with us (though usually in more artful 
and sophisticated guises), an examination of these will set the stage for the 
analytic study of transfer. 

The notion of formal discipline. The idea of formal discipline, for a 

long time, was the bete noire of educational theorists. This was the idea, 

informally held by many people, that mental exercise is good for the mind. 

The classical preparatory curriculum in education embodies this notion. 

In former days the hapless student studied Greek, rhetoric, geometry, and 

Latin, not so much for their intrinsic value but because they exercised and 

sharpened the “mind.” Learning to conjugate in Latin would make a keen 

wit as a lawyer, or training in logic would enable the physician to exercise 
the healing arts more effectively. 
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Despite the apparent embodiment of the notion of formal discipline in 

older educational programs, it never really had the support of thoughtful 

men (see Stroud, 1940). The evils of formal discipline, however, became a 

sort of straw man in educational theory, simply because 40 or 50 years ago 

the educators needed a good lever to pry the traditional curriculum loose 
from the schools. 

Early studies of transfer. Early experimental studies of transfer of 
training were in large part directed towards demonstrating the inadequacy 
of the formal discipline theory of transfer — the notion that there would be 
transfer between two tasks to the extent that the first provided good hard 
mental exercise. Consequently there was much early emphasis upon the 
small amounts of transfer that actually took place between the formal aca- 
demic disciplines. For example, Thorndike (1923) showed that there was 
only a slight advantage in the reading of English for students who had pre- 
viously studied Latin. Similar studies (Thorndike and Ruger, 1923; Wilcox, 
1917; Cole, 1924) showed the limited transfer of Latin to such varied as- 
pects of English as reading, grammar, and spelling, as well as to the acquisi- 
tion of modern foreign languages. 

Relatively early, Thorndike and Woodworth (1901) performed a series 
of experiments designed to test theoretically some of the implications of the 
notion of formal discipline. In addition to experiments on classroom learn- 
ing, Thorndike and Woodworth performed studies of transfer in laboratory 
tasks. For example, they practiced their subjects in the estimation of the 
areas of various geometrical figures such as triangles and rectangles. They 
examined the influence of transfer from learning to estimate the area of cer- 
tain specific forms to the ability to estimate the area of other forms. The 
results of this experiment led Thorndike to an “identical-elements” theory 
of transfer . This notion and some that grew up in opposition to it are impor- 
tant, for they are the direct ancestors of our current interpretations of the 
mechanisms in transfer. 

Identical elements versus general principles in transfer. The identical- 
elements theory of transfer says that the specific elements identical to two 
tasks are transferred from one task to the other. Thus, training in addition 
benefits performance in multiplication for the simple reason that many of 
the responses required in addition are the same as those required in multi- 
plication. From the notion of identical elements, then, one would expect the 
amount of transfer to be determined by the number and importance (rela- 
tive frequency of occurrence) of responses common to two tasks. 

This concept is a vast improvement over the omnipresent transfer pre- 
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dieted by such a notion as formal discipline, for the theory of identical ele- 
ments clearly predicts that transfer will differ in amount between two tasks 
according to the variations in the tasks themselves. 

Educational theorists, however, have not always liked the notion of 
identical elements because of its suggestion of an atomistic structure in 
what individuals learn. People leam, they say, not just isolated responses 
which may work in some new situations and not in others; they learn gen- 
eral principles and rules for applying these principles in a rational way. 
Thus, to use the same example we used to illustrate identical elements, it 
is not merely that a child can apply what he has learned about sums to 
multiplication of /z-digit numbers; it is that some general principles about 
the properties of decimal numbers appl\' to multiplication. It is as if a 
bright child were taught the basic principles of addition and subtraction., 
and from these, himself discovered multiplication and division. 

For a while edudational psychologists and learning theorists acted as if 
there were two mutually exclusive notions about transfer — one right and 
one wrong. Common experience, however, suggests that transfer can occur 
by both identical elements common to two tasks and general principles that 
govern two or more tasks. The basic psychological problem is not to decide 
which of these two possible modes of transfer is correct or more valuable, 
but to attempt to uncover the basic rules governing all kinds of transfer. 
This has been the goal of most learning theorists interested in transfer, and, 
in part at least, they have been successful. The most elementary step, of 
course, is a stimulus-response analysis of the transfer problem, and that is 
what we shall examine next. 


The Stimulus-Response Analysis of Transfer 

A stimulus-response analysis of transfer is important because the most 
elementary way two tasks can differ from one another is to differ in some 
simple stimulus or response component. Thus if we train an organism to 
make some response to a particular stimulus and then test for this response 
with a slightly different stimulus, w'e have an opportunity for the evaluation 
of transfer. We can see that this is the kind of situation in which we measure 
stimulus generalization; indeed, this generalization turns out to be a special 
case of transfer of training, perhaps the most elementary. Let us examine 
it as a special case of transfer. 

Stimulus variation with response held constant. The essential charac- 
teristic of the measurement of stimulus generalization is that we hold some 
response constant and vary the stimulus that has either been conditioned to 
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the response or has been consistently paired with the reinforcement of that 
response. Remember that the major results of the study of stimulus general- 
ization showed that the more similar the test stimulus was to the original 
training stimulus, the greater the potential for eliciting the original response. 
From these studies we should guess that the amount of positive transfer 
would gradually decline as the stimulus situation was changed from what 
It had been during original training (assuming the responses required re- 
main the same). Such a relationship is illustrated in Figure 46. On one axis 
in this figure is the amount of (positive) transfer from 100 per cent to zero 
and on the other axis is stimulus similarity. Notice that 100 per cent transfer 



Figure 46. Per cent transfer of training between two tasks when the responses are identical and 
the stimuli are varied from being identical to being completely dissimilar. 


is defined as that case in which the test stimulus is exactly the same as the 
one used in original training. 

We may ask, Does the kind of relationship shown in Figure 46 hold for 
more complicated examples than conditioning? That is a difficult question 
to answer, since procedures used in the study of different learned acts are 
likely to be very different. Let us, however, examine some relatively simple 
examples of verbal learning. 

In one well-known experiment (Yum, 1931) subjects learned to asso- 
ciate nonsense syllables with stimulus items. Each syllable was paired with 
a different item, and after the subjects had learned all the associations, they 
could recall the appropriate syllable when they were given the proper stim- 
ulus item. In some conditions the stimuli were other nonsense syllables and 
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in others they were abstract geometric forms. Let us look at one condition 
in which the stimuli were the latter. The subjects first learned to pair each 
form with a different nonsense syllable (there were 12 in all). Then, 24 
hours later they were presented with the stimuli once more and asked to 
recall the right syllables. In some conditions the same stimuli as those used 
in original training were presented. In other conditions these stimuli were 
altered so that they varied all the way from resembling the original stimuli 
very closely to resembling them only vaguely. The results of the recall test 
were as follows: 



Per cent 

Per cent of recall to 

Stimuli 

recalled 

identical stimuli 

Identical 

84.6 

100 

Highly similar . . . 

64.5 

76 

Similar 

49.2 

58 

Less similar . . . - 

45.3 

53 

Least similar . . 

36.3 

43 


We can see that as the similarity of the stimulus items to those used in 
original training decreased, the percent of items recalled decreased. Thus, 
there was less positive transfer with dissimilar stimulus items than with sim- 
ilar ones. 

Results much like these w'ere also obtained by Gibson (1941), but her 
experiments went one step further. This is important because one could 
argue that the subjects in the experiment just described did not respond to 
the nonidentical forms as frequently as they did to the identical ones simply 
because they did not know that they were supposed to. Gibson, however, 
studied relearning of the same responses when the stimuli were identical or 
somewhat varied. The results were much the same; there were greater sav- 
ings in relearning when stimuli were identical or highly similar than when 
they were only slightly similar or dissimilar. 

Changing responses and holding stimidi constant. What happens when 
the stimuli for two different tasks are the same but when the responses are 
different? There is an old rule of thumb that says this situation always pro- 
duces negative transfer. This implies that the second task is harder to learn 
than it would have been if there had been no first task. The further impli- 
cation is that the responses learned in task one interfere with the acquisition 
of new responses to the same stimuli. We shall deal with this implication at 
length in the next chapter. 

For now, we should note that this rule is a considerable simplification 
of what really happens in practice. Actually, it sometimes happens that 
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this response much faster than they did the original right-leg response 
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In this situation the transfer is evidently a matter of components of the 
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e other feet as well. The degree to which the conditioning of the right rear 
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w 11 respond dunng ongmal conditioning with one or more of the other feet 

(It should be mentioned that in these experiments the dogs were supported 

such a way that they did not need to use their feet for support) 

t 1 changing the response leads to negative 

ransfer This would happen if a subject learns the association Sj-Rj and 

then finds it more difficult to learn the association Si-R^ than if he had 
not made the original association. In one of the best-known experiments on 
IS problem (Bruce, 1933) the associations consisted of pairs of nonsense 
syllables. Thus the subjects would be required to give a response like kiv 
when given the stimulus req. For the second list the subject might be re- 
quired to give a response like zam to the same stimulus req. Under these 
conditions negative transfer seems to occur. Learning a new response to 
an old stimulus produces interference. 

Not all experiments, however, have found this to be so. For example, a 
more recent experiment (Porter and Duncan, 1953) which used the same 
design, except that two-syllable adjectives were used instead of nonsense 
syllables, showed no negative transfer when a new list of response items 
was learned to old stimulus items. The lack of negative transfer was partly 
explained by the fact that since the subjects in this experiment were not 
highly practiced at rote verbal learning, simply learning some of the tech- 
niques of this kind of memorization obscured some of the negative trans- 
fer. There is, however, a more fundamental reason why negative transfer 
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did not occur, for in this experiment negative transfer was found with 
slightly different conditions. If the second task consists not of an entirely 
new list of response terms learned to the same stimuli but of the same re- 
sponse terms shuffled so that they are paired with different stimulus items, 
negative transfer occurs. Thus under this condition negative transfer was not 
swamped by general-practice effect. The reason, of course, was that the 
interference was much greater because the responses in the second task 
were already connected to other stimuli in the list making up the first task. 

Thus, in general, it appears that it all other things are equal (this usually 
means that there is no practice effect getting in the way), responses which 
are mutually exclusive will interfere with one another when we try to at- 
tach them to the same stimuli. In the experiment we mentioned a few para- 
graphs ago on the conditioning of different legs in dogs, the responses were 
not mutually exclusive or antagonistic, and they did not interfere with one 
another; indeed, since they were all part of common components of reaction 
to electric shock they tended to facilitate one another. 

This difference of positive transfer between responses learned to the 
same stimuli when they are components of the same act and negative trans- 
fer when the responses are mutually exclusive is sometimes expressed by 
saying that the kind and amount of transfer depends upon the amount of 
response similarity. Response similarity means that wfflen a response is asso- 
ciated to a given stimulus there are other responses which will tend to be 
elicited by that stimulus. It is assumed that there exists some- gradient of 
response similarity such that, to a given stimulus, some responses have a 
greater probability of being elicited than others. We can see that the notion 
of response similarity is roughly analogous to stimulus similarity. 

The transfer surface. We can put all of the stimulus-response relation- 
ships in transfer together in one diagram, but the diagram will require 
three dimensions, one for stimulus similarity, one for response similarity, 
and one for amount and direction of transfer. Such a diagram, derived from 
one originally drawn by Osgood (1949), is shown in Figure 47. The part 
of the figure in the lower right-hand corner is where maximum negative 
transfer is found. This is the case in which the stimuli between two tasks 
are the same but the responses required are different. On the other hand, 
maximum positive transfer is found in the upper left-hand corner where 
stimuli and responses are identical — in other words when the second task 
is merely further practice on the first. From this diagram we can see that 
further practice on the same task is only a special and limiting case of positive 
transfer, and perhaps logically it should be. On the opposite side of the dia- 
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transfer to performance at the typewriter. 

shim all of the possible stimulus-response relation- 

ships between two tasks. It ought not, however, be taken too literally. For 



one thing— as we shall shortly see— there are apparent exceptions to the 

j ^ diagram. For another thing we do not have general 
quantified scales for stimulus and response similarity. Therefore the exact 
shape of the diagram has no meaning; only the general relationships are 
meaningful. If any of the dimensions were expanded or contracted at the 
expense of any others, it would not do violence to the basic notion presented 
in the diagram. In short, it is a qualitative picture of the stimulus-response 
relationships in transfer, and with the exceptions noted below, it works 
lairly well in summarizing data on transfer. 

Transfer situations not summarized by the transfer surface. There are 
many kinds of transfer situations that are not easily summarized by the 
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transfer surface. Among these is learning how to learn, the general improve- 
ment that comes in learning many different tasks all of which demand a 
common technique of learning. In theory, it can be argued, this kind of 
transfer can be considered as a special case of the transfer surface, by 
thinking of the technique of learning (as it occurs, say, in rote verbal mem- 
orization) as consisting of a series of highly similar responses involving 
mode of attack, etc., which run through all the tasks to be learned. We are 
not always sure what such responses would be, however, and considering 
the general problem of learning how to learn as a special case of response 
similarity leads more to the aesthetic satisfaction of having one simple prin- 
ciple of transfer than to a real understanding of the mechanisms in learning 

how to learn. 

More seriously, there are real exceptions to the rules implied in the 
transfer surface. For example. Young and Underwood (1954) note that 
positive transfer does not always occur when stimuli are different and re- 
sponses are similar to one another. They showed that there will be little 
relationship between response similarity and amount of positive transfer if 
the responses within the two tasks are not difficult to differentiate. In other 
words, similarity of response between two tasks will not be an important 
variable in transfer unless the responses within the tasks are difficult to 
differentiate one from another. Thus the extent to which the transfer sur- 
face applies depends upon the stimulus-response relationships within tasks. 

An even more serious difficulty with an uncritical use of the transfer 
surface is indicated in an experiment by Deese and Hardman (1954). In 
this experiment subjects learned to complete sentences by the method of 
paired associates. Thus a statement like “the path is” would be given as a 
stimulus item, and the subjects would learn to complete the statement by 
giving the correct word “rocky.” A series of such statements were given as 
the original task. In the second task, the same stimulus items were given 
but the subjects had to learn new response terms. In one case the new re- 
sponse terms were synonyms (thus in the case of the path is, the new re- 
sponse would be “stony”); in the other case the new response terms were 
unrelated to the old ones (thus to the example given, the new response 
would be “short”). Looking at the transfer surface in Figure 47 we should 
probably expect the synonyms to yield positive transfer and the unrelated 
words to yield negative transfer. And that is exactly what happened in the 
learning of the second task. So far so good. This experiment went one step 
further however; the subjects were asked to relearn the original list. In this 
relearning, there was negative transfer in both cases and the amount of 
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negative transfer was greater after learning the synonyms than after learning 
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Identical elements and common principles in the transfer surface. The 

transfer' Th t descendant of the theory of identical elements in 

er J experimental studies of transfer have 

emphasized t^he analytic aspect of the problem, and the relationships be- 

ween tasks have been dissected into their stimulus and response compo- 
ents. The similarity relationships that have been studied can easily be con- 
sidered in terms of common elements. Therefore the transfer surface can be 

taken as a kind of detailed account of the transfer one would expect with 
Identical elements in two tasks. 

What about transfer by general principles? It is not difficult to stretch 

ame stimulus and response analysis to this case. One well-known study 

o transfer in a schoolroom task illustrates this. Knight (1924) had two 

groups of children practice the addition of fractions. One group practiced 

on fractions with denominators ranging from 2 through 30. Another group 

practiced with a limited number of denominators, 2, 4, 6, 8, 12, and 24. 

A test was given on those denominators practiced by the first group but not 

by the second. Both groups did just about equally well on the test. In other 

words transfer was perfect. It is easy to see that this example, which fits 

t e escription of transfer by general principle, also is in accord with the 

transfer surface. In this case some component, a general rule, was common 
to the two tasks. 

Wherever possible we shall try to refer examples of transfer of training, 
oth in this chapter and succeeding chapters, to the transfer surface. While 
some experimental situations do not lend themselves easily to a simple 
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stimulus-response analysis, the general usefulness of the transfer surface 
is considerable. 


SPECIAL PROBLEMS IN TRANSFER 

Learning How to Learn 

One of the most general and important kinds of positive transfer is 
learning how to learn. Every investigator of verbal learning knows that sub- 
jects who have never learned by rote association do relatively poorly in 
laboratory studies of paired-associate or serial-anticipation learning. Such 
subjects will take a large number of trials to learn their first list of, say, 
nonsense syllables. They generally show great improvement on the next 
list, however, and on each succeeding one they show additional improve- 
ment. After 15 or 20 lists, however, the amount of improvement from list 
to list is relatively small; the subjects have learned just about all they can 
about the techniques of learning nonsense syllables by rote. 

Learning how to learn is a very general phenomenon. So far as we know, 
it occurs in some degree for all techniques of learning and for all kinds of 
tasks. One of the central problems in the transfer of training is to find out 
how and why such improvement occurs, as well as to see what the implica- 
tions are for more general problems in the psychology of learning. 

Learning sets. In the chapter on discrimination learning we examined 
one such attempt to understand the mechanisms in learning how to learn. 
This particular problem is the formation of learning sets and is specifically 
concerned with the extent to which improvement occurs in successive dis- 
crimination problems. 

We have seen in our discussion of discrimination learning that as organ- 
isms are given successive problems, their ability to solve each problem im- 
proves. Thus it may take a monkey many trials to learn that a raisin is to 
be found under a red box instead of a blue cylinder if this is the first dis- 
crimination test it has ever worked at. After a number of such tests, how- 
ever, the monkey needs only one trial to show perfect discrimination. 

An analysis of the learning-set problem showed that it was the result of 
reduction of stimulus generalization (and hence negative transfer, since 
each problem requires a different stimulus-response connection) and the 
gradual emergence of a new habit, a habit of testing stimulus objects irre- 
spective of every dimension except position (left or right) for their correla- 
tion with reinforcement. The implication is that through the reduction of 
stimulus generalization and negative transfer a new habit is allowed to 
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emerge which transcends the particular problem the animal is working on. 
The animal learns to look for the significant variation in stimuli, and his 
solution is likely to appear to be insightful. 

The development of a learning set can be characterized as the emergence 
of a higher-order habit composed of the single common component rein- 
forced in a number of lower-order habits. Each discrimination problem can 
be looked upon as a lower-order habit which must be mastered. The only 
behavioral feature that is consistently reinforced from problem to problem 
is the habit of making a choice and changing the choice when the initial 
one is incorrect. The habit of choosing blue stimuli because of correlation 
with reward in one problem or square stimuli for another problem will 
only be reinforced part of the time. The only aspect of the set of discrimina- 
tions consistently reinforced is that of reversing the choice of stimuli when 

the initial one is incorrect and keeping with the initial choice when it is 
correct. 

Do learning sets occur outside discrimination-learning problems? As we 
shall see later, there is an analogy between learning abstract concepts and 
learning sets. Furthermore, probably many examples of learning how to 
learn are the result of emergence of higher-order habits. Moreover, even 
the relatively simple examples of learning how to learn illustrated by dis- 
crimination learning in monkeys involve such diverse factors as learning 
not to perseverate to the same stimulus from trial to trial and learning to 
respond to differential aspects of the cues (Harlow, 1950b). 

Teaching how to learn. We have seen that there is a general improve- 
ment in ability to learn that comes with practice at a particular kind of 
learning. There is more to it than this, however. Many kinds of remedial 
educational programs are built on this assumption, plus the supposition that 
special methods of teaching how to learn will produce even greater im- 
provement. 

In a classical experimental study Woodrow (1927) showed that specific 
instruction in how to memorize poetry, prose, and factual material resulted 
in greater improvement in ability to memorize than undirected practice. 
Woodrow stressed many of the techniques that are now standard in teach- 
ing students how to learn. He used active recitation, attention to meaning, 
techniques for maintaining continuous alertness, etc. Woodrow’s study was 
not analytical, and we cannot tell which of the factors was most important 
in improving memory. The important point, however, is that training for 

memory does have beneficial effects over and above those that come from 
practice alone. 
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Differentiation and Transfer 

In the section devoted to stimulus-response analysis and theory of trans- 
fer we pointed out that one important interpretation of stimulus and re- 
sponse relationships was that transfer was to a considerable extent mediated 
by differentiation. This was the view of Gibson ( 1941 ), and it has received 
considerable experimental attention (Young and Underwood, 1954, Under- 
wood, 1954a). One of the implications of this notion is that if we could 
induce differentiation among either the stimuli in a task or the responses 
without specific practice on the task or on the responses in it, we could 
produce positive transfer. 

A simple example of stimulus predifferentiation is illustrated in an ex- 
periment by Gagne and Baker (1950). These investigators gave subjects 
the task of moving a switch when a particular light came on. There were 
a number of lights located in different positions, and each light signaled a 
separate switch. Some of the subjects were given preliminary practice, not 
in moving the switches, but in merely assigning letters of the alphabet to 
each of the lights. This practice in associating the lights with names had an 
enormous influence on learning to associate the lights with the switches, 
even though the switches had nothing to do with the letter names. Those 
subjects that had no stimulus-predifferentiation training made about 7 
mistakes in their first 20 trials on the switch task. Those who had 32 stim- 
ulus-letter naming trials made only about 3 errors. 

This experiment was interpreted to mean that stimulus predifferentiaticn 
reduces the amount of stimulus generalization between the various stimuli 
so that in the final task, when these stimuli are associated with motor re- 
sponses, they are more distinctively different for the- subjects. 

There have been many similar experiments demonstrating essentially the 
same thing. Goss (1953), however, has pointed out that it is not entirely 
certain that the obtained transfer in these studies has been due to stimulus 
predifferentiation alone. He has further analyzed this problem and shown 
that while stimulus predifferentiation is indeed the main factor in the trans- 
fer effect, other things, such as warm-up, also contributed. In general, any 
kind of labeling or differentiation of stimuli will help in any subsequent task 
that demands different responses be attached to these stimuli. 

Transfer of verbal learning to the acquisition of a motor skill is common, 
and it very likely occurs principally because of the differentiation which 
preliminary verbal training allows. Battig (1956) reminds us that the 
learning of motor skills involves both discovery and performance. Discovery 
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means learning to differentiate between stimuli, to pick out their critical 

aspects, and to learn which ones are to be reacted to in a different manner. 

Performance means the actual motor responses which must be made. In 

many motor skills preliminary verbal training will suffice for learning the 

stimulus relationships. Battig showed that the amount of transfer from 

verbal pretraining to a motor skill varied inversely with the complexity of 

the motor activity required. When the motor, or response, side of the skill 

is relatively unimportant, verbal pretraining is of great value. If the motor 

aspect IS too complicated, there is little or no transfer to the motor skill. If 

these results are generally true for many different tasks, we may have an 

important principle for the application of verbal pretraining to learning 

skilled performance; the value of such training decreases as the motor com- 
plexity of the task increases. 

Transfer and Difficulty 

There are some rather surprising and complicated relationships between 
transfer and the relative difficulty of the tasks involved. In some cases trans- 
fer from an easy task to a hard one is greater, and sometimes the converse 
is true. Let us examine the latter case first. 

An experiment by Jones and Bilodeau (1952) illustrates the situation 
in which transfer is greater from a hard task to an easy one. Both tasks 
these investigators used involved coordinating the two hands to keep an 
indicator on a moving target. The difference between the two tasks was in 
the complexity of the movement of the target. In the hard task, the target 
moved in a very complicated path; in the easy one it moved in a relatively 
simple path. With the use of several groups of subjects, the investigators 
studied the transfer from the hard path to the easy one and vice versa. 
There was very great transfer from the complicated path to the easy one; 
as a matter of fact the transfer in this case was equivalent to direct prac- 
tice on the easy task. There was also positive transfer from the easy task 
to the complicated one, but not nearly as much. 

In the particular situation studied by these investigators, the harder task 
involved all of the components of the easy task and permitted greater 
practice. The easy task had only some of the same response components as 
the difficult task. Therefore the transfer, while positive, was somewhat less. 

In general, this will be true; in tasks in which more difficult versions con- 
tain all of the stimulus-response elements in simpler versions plus additional 
ones, positive transfer may be large, indeed, as large as the effects of direct 
practice in instances where no negative transfer from particular stimulus- 
response elements occurs. 
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In other situations, however, it seems that preliminar>' practice on easy 
tasks produces greater transter than preliminary practice on difficult ma- 
terial. This was the case in a study of discrimination learning in rats (Law- 
rence, 1952). The rats were trained to discriminate between the brightness 
levels of two goal boxes. In one condition the animals received all of their 
training on a difficult problem, that is to say, a problem in w'hich the bright- 
ness levels were nearly the same. In other conditions the animals received 
preliminary training on problems in which the brightness levels of the two 
boxes were cjuite different, so that discrimination was easy. All animals 
were eventually tested on the difficult discrimination. 

The results, rather surprisingly, were that ail the animals trained on 
easier problems did better than the animals trained on the difficult problem. 
This result is probably the result of factors much like those responsible for 
the stimulus-prcdilferentiation effect. 1 raining on the easier discrimination 
allowed the animals to discover more readily the critical difference be- 
tween the two boxes and pay attention to it and to no other stimulus con- 
ditions. Thus, when these animals were transferred to the more difficult 
discrimination they were alerted to brightness differences and found it easy 
to solve the problem. Animals trained all along on the difficult discrimina- 
tion, however, found it difficult to solve the problem because the small 
brightness difference between the two boxes was never brought directly to 
their attention. 

Thus in cases in which difficult problems involve stimulus-response com- 
ponents identical to those in easier tasks (but also additional stimulus- 
response components) there will be more transfer from the difficult prob- 
lem than from the easier one. In those cases in which the problems involve 
the isolation of relevant stimulus dimensions and the association of certain 
stimuli with reward or solution of a problem, training on easy problems will 
transfer more than training on difficult problems. 

These two different cases have not been completely explored, but it is 
likely that the principles in them can be generalized to many different tasks. 
If so, we have some important rules about the application of transfer of 
training to problems of varying difficulty. 

POSITIVE TRANSFER IN SCHOOL LEARNING 

We have already seen that transfer of training has been one of the prob- 
lems of major concern to educational psychologists. We shall not look at 
any of the practical problems of the applications of transfer principles to 
the school situation, but we shall look at some of the more general ques- 
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tions that have come from the study of transfer in school. One of the most 
important of these is training for transfer. 

Training for Transfer 

One of the basic problems in teaching is to maximize the amount of 
transfer. We have seen that transfer from one task to another may not be 
very large; consequently educational psychologists have looked for ways 
to increase the amount of transfer. Mostly this has meant that psychol- 
ogists working in the school situation have tried out various methods for 
making transfer greater between academic subjects, or an academic situa- 
tion, and some practical problem. An example will show the sort of things 
educators have been concerned about in transfer. 

An example of training for transfer. It is generally accepted that a for- 
mal academic discipline like high school geometry provides some transfer 
to reflective thinking in general. In some circles this is regarded as a hold- 
over from the notion of formal discipline, but, of course, such transfer does 
occur. The postulational technique demonstrated in geometry does general- 
ize to other things, and students are sometimes able to see this. The late 
Clark Hull, as a matter of fact, said in his autobiography that his interest 
in scientific theory generally came from his high school days and the study 
of geometry. 

In the main, however, transfer from a traditional geometry course to 
other kinds of situations involving reasoning is disappointingly small. Ulmer 
(1939) performed an experiment in an effort to see what could be done to 
improve the amount of transfer by designing a special course in high school 
geometry for transfer. 

Ulmer used three large groups of high school students in his study. One 
group served as a control. A second group of students was taught ge- 
ometry by the traditional methods. A third group was taught geometry 
through a program designed to arouse critical thinking both connected and 
unconnected with geometry. The teachers for the students in this group 
used a special manual designed to extract as much training in general criti- 
cal thinking from geometry as possible. The results of this study were very 
clear-cut. On tests of syllogistic reasoning and in the detailed records of 
teachers’ log books it was clear that the students in the third group were 
able to transfer much more adequately the basic principles of logical 

thought as exemplified in geometry to the tests of reasoning than the group 
trained in the traditional method. 

What are the conditions of training for transfer? Ulmer’s study gives 
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impressive evidence for the possibilities of training for transfer, but it does 
not tell us about the nature of training for transfer. We cannot know, from 
the available evidence, the complete answer to this question, but we do 
know enough to be able to see some of the more important factors. 

One paramount factor is the control of generalization. We saw in studies 
of animal learning and human rote verbal learning that generalization is an 
essential element in both positive and negative transfer. We also saw in the 
experiments on the formation of learning sets that experience at multiple- 
task learning gives learners direct control over generalization. That is to 
say organisms learn to vary generalization voluntarily. Variation in gen- 
eralization is learned as a critical element in problem-solving behavior, and 
the sophisticated learner (be it rat or man) varies the extent to which be- 
havior is generalized in accordance with other things learned about the 
testing situation. Thus the monkeys in the learning-set experiments learned 
to test specific hypotheses about which stimuli would lead to reward, rather 
than to rely simply on the strongest generalization of the moment. 

This is extremely important, and we shall discuss it again in connection 
with problem solving. It should be clear in the present context, however, 
that broad gener^ization is essential for transfer. Unfortunately transfer is 
both positive and negative, depending on whether the generalized behavior 
is compatible with the requirements of a new situation or not. A higher- 
order behavior is achieved when the organism can decide whether or not 
to vary transfer based on its estimates of the relative probabUities and 

advantages of negative and positive transfer. 

There are numerous applications of particular techniques of training for 

transfer. Many can be found in basic textbooks in educational psychology 
(for example, see Frandsen, 1957). Such matters as unifying concepts in 
arithmetic and phonetics in spelling are all important questions of transfer 
in the elementary curriculum. Indeed, a good deal of the dispute over the 
use of the so-called “phonic” method in reading is basically concerned with 
whether this method produces mostly negative or mostly positive transfer 
for reading at higher levels. The opponents of the widespread use of this 
method have emphasized the potential sources of negative transfer. The 
proponents of the method have emphasized its wide generality, and hence 
by inference, its high potential for positive transfer. Since almost any rule 
or principle in language learning can potentially produce both negative and 
' positive transfer, much of the concern over different methods of teaching 
language skills involves the question of transfer. 

To emphasize the potentiality for negative transfer in language learning. 
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consider some of the erroneous definitions 
a vocabulary test (Johns, 1939); 


given by university students in 


Elucidation 

To get away from (“elude”) 

Elimination (from the sound) 

Coerce 

Speak, discuss (possibly from “converse”) 
Plan of study (from “course”) 

Innate 

Without life, dead (from “inanimate”) 

Silly (“inane”) 


Obviously, some training in the fundamentals of English 

vide enough control over generalization so that positive 
tive transfer would result. 


words would pro- 
rather than nega- 


J 


Ability and Transfer 


In the study of Ulmer on the teaching of geometry, students of aU levels 
o intellectual ability seemed to profit from the specialized methods of 
teaching designed to increase transfer. There was, however, a relationship 

u from transfer and intelligence. This relationship held 

both for the experimental condition, in which the specialized method of 
c ing was used, and for the traditional method of teaching geometry. 
There was always more transfer from geometry to improvement in syllo- 
gistic reasoning for mteUigent students than there was for average or in- 
fenor students. 


This relationship between transfer and intelligence is characteristic ol 
many studies. As a matter of fact, it almost looks as though the ability to 
recognize and make use of a general principle when it is avafiable is a basic 
actor in intelligence. Most of the educational implications of such a rela- 
tionship are quite obvious. There is one important implication that is per- 
haps not so immediately easy to see. Transfer by general principle may not 
work well with younger children because of the correlation between age 
and mental maturity and the dependence of transfer by general principle 
on mental maturity. In one study of second grade children, for example. 
It was found that simply showing the children that arithmetic procedures 
could be generalized >vas more productive of transfer than attempting to 

procedures (Overman, 1930-1931). These second 

cinip • ' mature enough to grasp the general prin- 

ciples involved. They had to be taught, more or less by rote, to apply the 


Jt 


TRANSFER OF TRAINING 


235 


rules that they had learned. As we might suppose, those children higher in 
mental ability profited more from the rationalization procedure. 

The relationship between transfer and mental maturity plays a consider- 
able role in an important educational concept, the concept of readiness. 
Educational psychologists have made much of the futility and, indeed, dan- 
ger inherent in trying to teach children skills before they are ready for 
them. In general this means that one should not try to enforce practice at 
a skill until certain basic functions have matured to a certain level. It is im- 
possible to teach most preschool children to write, because they have not 
yet the command over the fine muscular movements that are required in 
writing. Such children can only make rather large and clumsy movements 
with their fingers and arms. It is to be noted, however, that a considerable 
(though indeterminate) part of the maturity required for the ability to 
write comes from previous training. Scribbling, drawing, working with the 
paint brush, and indeed, a whole host of related activities probably all 
give the child an ability to work with the muscles of his hands and arms 

that later transfers to writing. 

Transfer and Specialized Training Programs 

The value of general education can exist quite apart from any specific 
transfer value assigned to parts of it. This is not true of most specialized 
training programs, however. If the Army or the Air Force invests time and 
money in training a man for a particular job, it wants to know whether this 
training has actually made him better for the job than he was before train- 
ing. In other words, in specialized training programs transfer is a critical 

element. 

In many applied problems in transfer the important problem is that of 
validity. We need to ask the question. Does training on a special device or 
by special techniques show positive transfer to the actual task? There are 
many questions subsidiary to this main question. For example, on-the-job 
training is usually very expensive. Will a preliminary training program not 
only transfer, but will it transfer enough to the actual task so that the 
training program is, in the long run, cheaper than on-the-job training? Such 
problems are of very real concern to the applied psychology of transfer, and 
they have received much attention (Lawrence, 1954; Tufts Institute, 1952). 
It is too early to hope that the basic experimental work on transfer will 
allow us to predict well enough to answer these questions without a direct 
test in the applied situation, but it is not too much to hope that we shall be 
able to make fruitful guesses about which applied training programs will 
profit most from a revamping in view of the basic principles of transfer. 


CHAPTER 10 


RETENTION AND 



There are two ways of looking at the problems of learning. We can examine 
the way m which behavior changes as a function of experience; ordinarily 
this means that we view improvement in performance as the result of prac- 
tice. We can also take a fixed level of performance and see how that per- 
formance IS retained over an intervening time interval. This is the method 
we shall consider in this chapter. Both of these approaches are, of course, 
simply different views of the same problem. If there were no retention, there 
could be no learning, since the gradual increase, or improvement, that 
occurs with practice is based upon residual benefits from previous practice. 
This suggests that the factors underlying retention are the same as those 
underlying learning and, conversely, that the factors underlying forgetting 

are the same as those which tend to slow down the learning process. If this 
is so, why a special study of retention? 

The answer to this question is that such a study allows us to examine 
changes in behavior uncontaminated by further practice or exposure to 
learning materials. When we have determined to what extent and in what 
manner something is retained, we have a measure that tells us how mental 
processes reorganize material already learned without additional exposure 
to the material. From a study of this process of reorganization we can learn 
something about the mental processes themselves. 

Before we begin an examination of retention, we need to define the two 
words retention and forgetting. Retention refers to the extent to which ma- 
terial originally learned is still retained, and forgetting refers to the portion 
lost. Therefore 

Amount forgotten = amount learned — amount retained 

236 
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Amount retained is always measured directly, and amount forgotten is ob- 
tained by subtraction. Since amount retained is the direct measure, it is 
the measure we prefer to use. It is useful, however, to talk about forgetting 
when we are considering those conditions which tend to reduce the amount 
retained, and this chapter is therefore separated into two sections — one 
called Retention and one called Mechanisms of Forgetting. In the second 
section, emphasis will be on forgetting as an active process, and we shall 
discuss those experiments in which there is a deliberate attempt to reduce 
the amount of material people will be able to retain. In the section on 
retention we shall examine studies of the amount and the type of things 
people remember after various intervals of time. 

RETENTION 

Retention was one of the first aspects of human learning and verbal be- 
havior to be studied by laboratory methods. Ebbinghaus studied retention as 
part of a sustained attack on the “higher mental processes.” The results of 
his studies were published in a monograph in 1885 called Uber das Ge^ 
ddchtnis, or Concerning Memory. This monograph is a landmark in experi- 
mental psychology, not only because it reports important findings but 
because it represents the first experimental study of the problems of human 
verbal behavior. This opened up the whole field of experimental investiga- 
tion of thought, memory, and verbal learning. 

One of the problems which Ebbinghaus studied was how much can be 
remembered after a certain time interval has elapsed since learning. It is 
rather surprising that no one had ever thought to do this before, even in a 
casual way, but the fact is that Ebbinghaus’s results represent the first really 
quantitative data on human memory. Before we look at his findings, how- 
ever, and those of his successors, it will be wise to examine some of the 
methodological problems in the study of retention. 

The Measurement of Retention 

Experimental Designs 

Experimental methods in the study of retention fall into two classes. In 
one class the same subjects are tested for retention of the same material 
several times. An experimental study of this sort yields very impure meas- 
ures of retention after the first test, since all subsequent measures are con- 
taminated by the practice the first test allows. As a practical matter, how- 
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ever, many studies of retention of school subjects have been carried out in 
this way. 

A more adequate approach allows only one test of retention after leam- 
mg. This can be accomplished by ( 1 ) repeated measurements on the same 
subjects, using different materials for each test, and (2) independent meas- 
urements of different subjects, using the same material. In the first method, 
the subjects may learn one set of nonsense syllables and be tested for re- 
ention after an hour, then learn another set of nonsense syllables of equiva- 
ent difficulty and be tested after two hours, etc. In the second method, the 

subjects are divided into several groups, each of which is tested after vary- 
ing periods of time. ^ 

There are basic and profound differences between these two methods, as 
we shall see later in this chapter. Ebbinghaus and many subsequent investi- 
gators used the first method, but modern investigators prefer the second 

because it eliminates possible interaction between materials learned at 
clifierent times. 

If either the same subjects remembering different material or different 
subjects remembermg the same material are to be compared, we must have 
some means of equating learning. This is generally done by using the same 
criterion of learning in all cases. Suppose that we want to find out how many 
nonsense syllables subjects can remember after 24 hours compared with 
the number they remember after 1 hour. We have two groups of subjects, 
both of which learn 15 nonsense syllables. We adopt a fixed criterion. 

sually this means that we stop practice for each subject after he has 
correctly anticipated all 15 nonsense syllables twice in succession. The sub- 
jects may, of course, be given different amounts of practice before a test 
or retention, but the number of items overtly learned is held constant, 
ometimes this is undesirable, and we adopt instead some criterion of num- 
ber of trials. Thus we may give all subjects in the two groups, say, 10 trials 
of practice. In this case we hope that, on the average, the individual subjects 
in the two groups will have learned about the same number of items, though 
we actually allow the number of items overtly learned to vary freely. 

In some cases we perform experiments in which there is no clear separa- 
tion between learning and retention. Many experiments on such dynamic 
factors as motivation in learning and retention are of this sort. The simplest 
and most common example is the measurement of the “immediate memory 
span. The immediate memory span is simply the number (and kind) of 
separate items an individual can remember after only one exposure. Thus 
we say the following series of numbers to someone, 4-7-2-6-3-8, and ask 
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him to repeat them, we are employing the procedure used in measuring 
the immediate memory span. We shall see the importance of this technique 
from examples given later in the chapter. Its limitation is that it does 

allow a clear separation of learning and retention. 

Measures of Retention 

The measurement of retention is not so simple as it sounds. There are a 
number of different measures, and they do not always even approximate 
one another. We are justified in calling them all measures of retention in 
that they tend to be correlated, though some are much more sensitive than 

others, and on occasion, may reveal quite different things. 

Direc, measures. The simplest kind of retention measurement is a direct 

measure of performance. In many cases this simply means giving the sub- 
ject one more learning trial. We may, for example, tram a rat to run a maze 
by giving it 1 trial a day for 10 days. We then may “test for retention by 
giving one more trial a month later. There may be no difference in the 
method of administering the trials; in the retention test, however, the 
emphasis is on the effects of the interval of time between the next to the last 
and the last trial. Thus we may compare retention after 30 days with re- 

tention after 1 day. 

In verbal learning we may use an exactly analogous technique. We may 
ask subjects to learn a list of nonsense syllables by serial anticipation and 
test retention simply by presenting the list once more. In this test, the sub- 
jects must anticipate the next item. In verbal learning, however, we may also 
make use of a somewhat different method of directly testing retention, which 
minimizes the support the subjects receive from the context of what they 
have learned. This is the method of recall. After we have had subjects prac- 
tice a list of nonsense syllables by serial anticipation, we may ask them 
simply to recall as many of the items in the list as they can. This eliminates 
much of the context provided by response-stimulus chaining in serial antici- 
pation. The result is that such recall scores are generally lower than the 

anticipation or relearning scores. 

Both recall and relearning scores may be expressed as a percentage of 
the number of items correct on the previous trial, which, if the subject 
achieves the ultimate criterion, will be all of the items. Thus if we ask a 
subject to learn a list of 20 syllables, and at recall he is able to recall only 
five of them, his recall, or retention score, is 25 per cent. 

Savings. The direct methods suffer from a lack of sensitivity. It is pos- 
sible, for example, that a subject may not be able to recall any nonsense 
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syllables at a test of retention, and his retention score is zero. Does this 

mean, then, that no effect of the previous learning remains? The answer is 

no, since a more sensitive method of probing may reveal some residual 
effects. 

The method of savings is a much more sensitive method for quantitative 

tests of retention. It may reveal retention where none seems to exist by 

straight recall, and, indeed, it may reveal “negative” retention — it can show 

that an individual recalls less than if he had never practiced the material 
being tested. 

As in the direct method, the subject is asked to practice an additional 

number of trials on what he has already practiced. The measure, however, 

is the percentage of time or trials saved compared with the original learning. 
It is given by the formula 

Per cent saved = No. trials to l eam - No. trials to relearn 

No. trials to learn 

Thus if it took 25 trials to memorize 15 nonsense syllables and 5 trials to 

relearn them after an interval of 1 hour, there remains a savings of 
80 per cent. 

Other methods. There are many variations of either the direct or the 
savings method. In addition, there is another important class of measures 
which can be summarized under the term recognition methods. These are 
very familiar, since one example is the multiple-choice test (invented by 
Ebbinghaus, by the way). This method has greater sensitivity than the direct 
measure, since it relies most fully on context. This is precisely its weakness 
for experimental work, however. It is furthermore dependent upon various 
artifacts of testing. If we simply ask someone whether or not he recognizes 
what we present to him, we run the risk of unreliable and false judgment. 

If we more strictly test an individual by asking him to pick out which of 
several items is correct, the probability of a correct selection on the part 
of the subject is at the mercy of the particular choices he has to make. We 
can vary the difficulty of an item simply by varying the incorrect alterna- 
tives. If we do this in a controlled manner, we may have a technique that 
is instructive. For most purposes, however, the method of recognition is lim- 
ited to the applied problems of retention of school material and to the study 
of perceptual memory. 

The study of perceptual memory sometimes introduces another tech- 
nique, that of reproduction. This is a variant of the method of recall, in 
which an individual is asked to draw or otherwise reproduce something he 
has previously seen. As we shall see shortly, this method has likewise proved 
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to be very instructive. It is, however, much at the mercy of response factors 
(ability to draw, for example) and variability in the judgments necessary 

to scoring. 

The Nature of Retention 

We can now turn to studies of the course and nature of retention. In 
general, there are two things which investigators have been concerned 
about: (1) quantitative changes in the amount remembered with increasing 
time since learning, and (2) qualitative changes and reorganization of the 
material originally learned. We shall deal with the question of quantitative 

changes in memory first. 

The Course of Retention 

Let us look first at the studies of Ebbinghaus on the course of retention. 
He was much influenced by the notion that the elements of mental activity 
come in discrete packets called "ideas. ’ The basic problem in the study of 
mental activity was how different ideas became associated and then disasso- 
ciated. Ebbinghaus wanted to study the association of ideas in its simplest 
form, that in which completely new ideas are formed by the subject and the 
course of their retention examined by the experimenter. Hence the inven- 
tion of the nonsense syllable. 

We do not need to accept Ebbinghaus’s devotion to the classical theory 
of the association of ideas, but one result of his views was that he studied 
retention of material in which, at the outset of the experiment, there was 
little or no associative value. In the light of our discussion about the effects 
of statistical dependency in material to be learned, we can see that his ma- 
terial was very difficult to learn and remember. 

The results of Ebbinghaus’s measurements of retention are shown in Fig- 
ure 48. The most important feature of this curve is its extremely rapid fall. 
Most forgetting, or loss of retention, occurs in the first few hours after 
original learning. 

The curve shown in Figure 48 has frequently been characterized as the 
typical retention curve, which in one sense it is. There are several important 
things to keep in mind about this curve, however. For one thing it was ob- 
tained with one highly practiced subject — Ebbinghaus himself. Anotlier 
important thing to observe about this curve is that it is based on the savings 
measure, not simple recall. 

In the older psychological literature, one may find many examples of the 
measurement, by various methods, of the amount retained over various pe- 
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riods of time and for various kinds of learning material. In general all these 
examples agree with Ebbinghaus’s results in one respect — decline in reten- 
tion is negatively accelerated (McGeoch, 1942; McGeoch and Irion, 1952). 
This seems to be true for the measurement of the retention of nonsense 



LOG MINUTES SINCE ORIGINAL LEARNING 

Figure 48. Retention (in per cent saved at relearning) as o function of time v/ithout practice. 
Notice that the curve is nearly a straight line plotted against log time. This means that most of for- 
getting took place in the first 24 hours. (Data from Ebhinghavs, 1885.) 


syllables by other methods, the retention of meaningful material, and the re- 
tention of psychomotor skiUs. Of course, after special conditions of massed 
practice reminiscence may occur and lead to a temporary rise in the curve of 
retention. Under well-distributed practice, however, negative acceleration of 
the retention curve seems to be the general rule. 

The rate and limit of forgetting is determined by many factors. The greater 
the degree of organization in material learned, the slower the rate of for- 
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getting and the higher the limit of retention. This in part is because of the 
greater redundancy of highly organized material. We know that when an 
individual is required to remember more material within a given context, 
there is lower retention measured by savings (Newman, 1939). Therefore, 
higher redundancy produces better retention principally because of the 
smaller number of things to be remembered. By the same token, memory 
for ideas in a prose passage is better than memory for factual detail (Briggs 

and Reed, 1943) . 

Another important determiner of the rate and extent of forgetting is 
amount of practice. It is well established that amount of retention is roughly 
proportional to amount of original practice (Ebbinghaus, 1885; Krueger, 
1929). This relationship is sometimes expressed by saying that overlearning 
is beneficial to retention. Overlearning means continuing practice after some 
material has already been completely mastered. In general this point is well 
taken; practice beyond the stage at which an individual reaches a criterion 
of 100 per cent correct responses does improve retention, measured either 
by recall or savings. A more thorough study of this problem is needed, 
however. It is unlikely that practice beyond the criterion of one errorless 
trial would indefinitely continue to increase retention and that the relation- 
ship between practice and overlearning would be the same for all kinds of 
material. At present, however, we are not able to state clearly the limits of 

this relationship. 

Retention and Speed of Learning 

Most psychological textbooks, when discussing retention, attempt to 
“correct” a common misconception by pointing out that there is a positive 
relationship between speed of learning and amount retained. That is to say, 
contrary to folklore, which holds that someone who learns slowly retains 
well, the experimental literature apparently demonstrates that people who 
learn slowly retain comparatively little. 

A paper by Underwood (1954b), which considers the problems of meth- 
ods of measurement in comparing individual differences in speed of learning 
and retention, shows that the question is not so simple as it first appears. 
Underwood analyzed many data on rote serial learning, originally collected 
for other purposes. These, like earlier data, showed that there was nearly a 
zero correlation between time to learn and amount recalled; there was a 
high positive correlation, however, between time to learn and time to re- 
learn, Underwood further analyzed the problem by finding a number of 
items which were anticipated an equal number of times during learning by 
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fast and slow learners. On the average, of course, fast learners anticipated 
more frequently than slow learners, but it was possible to find some items 
in which the anticipations for fast and slow learners were equal. Even for 
these items, however, retention was better for fast learners. 

This means, in effect, that any given correct anticipation strengthens an 
association more for a fast learner than for a slow learner. If this is so, de- 
gree of learning of the material is not equal before introduction of the re- 
tention interval. With this in mind, Underwood adjusted the association 
strength of items for fast and slow learners. When this was done, there was 
no difference in retention between the two groups. 

The net effect of this analysis is to show that very likely the critical dif- 
ference between fast and slow learners is simply the associative strength of 
particular items at the end of learning; fast learners have acquired more 
associative strength for particular items. It does not mean that fast learners 
have an inherent ability to remember things; it simply means that in a given 
amount of practice they generally acquire greater associative strength for 
the things they are learning. Perhaps they have more efficient means of 
coding (see below). 

Reorganization in Retention 

There is a good deal more to the problem of retention than simply 
whether or not responses are retained; they are reorganized in accordance 
with the individual’s previous habits, motives, and other unique character- 
istics, Such reorganization is perhaps the most fundamental aspect of re- 
tention and we shall want to pay considerable attention to it, both in this 
and subsequent sections. 

Clustering. One of the clearest and most important examples of reor- 
ganization of verbal material in memory occurs in the phenomenon of c/mj- 
tering discovered by Bousfield and his associates. Clustering can be demon- 
strated by the following experiment (Bousfield, 1953): Subjects listen to 
a list of 60 words. These fall into four categories — animals, names, pro- 
fessions, and vegetables. There are 15 words in each category. The 60 
words are presented in random order, so that words from the same category 
occur together only by chance. Immediately afterwards the subjects write 
down, in order, all of the words they are able to recall. The two important 
results of this experiment were: (1) the subjects recalled more words than 
they would with lists of random words, and (2) they tended to recall words 
in the same category together. 

As we have seen, in free recall of unorganized material subjects tend to 
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recall the last items of a list first and the remaining items in order of de- 
creasing associative strength (Deese and Kaufman, 1957). When items can 
be categorized by the subjects, however, they reorganize the list so that they 
tend to give similar items together. 

Bousfield points out that one of the principal determiners of whether or 
not a subject will recall a particular word in a list is the subject’s previous 
experience with it. If it is a word that has occurred frequently in the sub- 
ject’s past it is more likely to be recalled than if it has occurred very infre- 
quently.^ This simple law of habit strength cannot account for clustering, 
however. Bousfield and Cohen ( 1953) have emphasized the importance 
of a second determiner of clustering. This they call a relatedness increment. 
They believe that words related to one another facilitate one another in 
recall by raising in strength a superordinate category of w'ords to which all 
the words in a given category belong. The superordinale category is ac- 
tivated by a single high-strength word from the list; the category in turn 
then arouses the low-strength words in the list. For example, the occurrence 
of the w'ord "leopard” activates the mediated response "animal," which in 
turn arouses the other subordinate members of this class (Bousfield and 
Cohen, 1955). 

The important point about this notion is that the mechanism of clustering 
is not a direct matter of association. Rather it seems to be the result of an 
intervening mediational mechanism which is aroused by words on the list. 
Such a mechanism not only codes items presented to the subject in such a 
way that he can more easily recall them, but it reorganizes the order in 
which items are emitted in recall. 

Number of categories in recall. An important problem in clustering 
concerns the number of categories into which the items on a test of recall 
may be classified. If one holds the length of the list constant and varies the 
number of categories, the number of items in a category decreases. What 
happens to recall under variation in the number of categories? One might 
expect that recall would decrease as the number of categories increased be- 
cause too large a number of categories might exceed the immediate memory 
span (which is about seven items for adults). On the other hand, if the 
number of categories is reduced too far, there might be too many items in 
any one category for the immediate memory span, even taking into account 

iThe relationship between frequency of experience and habit or association 

f 'I * is to say, a word that is extremely 

familiar is not much stronger than one that is only moderately familiar, but a mod- 
erately familiar word is very much stronger than an unfamiliar word. 
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the advantages of coding. Perhaps one would expect some optimum num- 
ber of categories for a list of a given length. 

Actually, there has been little experimental work on this point. Mathews 
( 1954) has studied frequency of recall for lists 24 items in length composed 
of two, three, and six categories, which in this case were composed of names 
of artists, athletes, musicians, etc. The experiment showed that recall in- 
creased slightly as the number of categories increased from two to six. 
This is perhaps what one would expect since six categories are not too great 
a burden on the immediate memory span (assuming that the subject could 
remember one name from each category). The names themselves, however, 
perhaps have low associative strength for any particular mediating superor- 
dinate, with the result that as the number of names gets too large and the 
number of categories gets too small, the probability of arousing a good 
superordinate is reduced. 

The net result of all this is to suggest that we should be extremely cau- 
tious in generalizing the results on this point from any one limited experi- 
ment. If we knew more about the ways in which people formed their own 
mediating responses we could predict somewhat more adequately the spe- 
cific effects of number and kinds of categories, but at present there seems 
to be no way of finding out about these. 

The immediate memory span and recoding. We have mentioned im- 
mediate memory span frequently in the last few pages. It is of fundamental 
importance in the understanding of the organization of retention, and it 
deserves some attention in its own right. 

The immediate memory span is the number of items of a particular kind 
which can be accurately reproduced after a single exposure. The unmediate 
memory span for digits turns up as a standard item on nearly every intelli- 
gence test, and the norms for such tests tell us that the average adult is able 
to reproduce six or seven items accurately 50 per cent of the time. The 
immediate memory span is smaller with children and indeed, the fact that it 
regularly increases with age is one of the reasons it has a place in intelli- 
gence testing. 

The size of the immediate memory span (in number of items correctly 
reproduced) varies with the material used for testing. For example, it is 
somewhat smaller for unrelated monosyllabic words than it is for decimal 
digits. Now this fact might lead one to conclude that the number of items 
of a particular kind that an individual can remember after just one hearing 
is determined by the amount of information per item in the testing material. 
Miller ( 1956b) has explored this hypothesis and found that it does not work. 


RETENTION AND FORGETTING 


247 


We can think of the amount of information as the amount of uncertainty 
in any particular item about which we have to guess. Thus, if I ask someone 
to guess what word beginning with “A" I am thinking about, his uncertainty 
will be high, because there are so many words beginning with “A.” If I ask 
him to guess what number between 1 and 10 I am thinking about, his uncer- 
tainty will still be high but not quite so high, since this time there are only 
10 alternatives. If I ask him to guess which hand has the coin, the uncer- 
tainty is still less, since there are only two alternatives. 

Information is an important concept in communication systems and com- 
puting devices, and when we say that a given communication system or 
computer has a given informational capacity, we mean roughly that it can 
reduce uncertainty about alternative answers to a problem only at a particu- 
lar rate. 

If our memory storage system has a limited capacity to take in informa- 
tion, then we should be able to remember more binary digits - than decimal 
digits and more decimal digits than words in a test of immediate memory. 
This is because there is less information in each binary digit than in each 
decimal digit and there is less information in each decimal digit than in each 
word. 

We said a moment ago, however, that things are not that simple. Indeed, 
while our memory span for biitary digits is in general larger than for deci- 
mal digits, and our memory span for decimal digits larger than for words, 
there is not nearly enough difference between them to make information 
recalled a constant. Indeed, we can remember only one or at the most two 
more decimal digits than monosyllabic words. Thus our ability to take in 
material and store it momentarily is not determined by the amount of infor- 
mation in the material. What then does determine the limits of immediate 
memory? 

Miller (1956b) has made a start at answering this question. First of all, 
he says, wo have to accept the notion that what people remember in the 
immediate memory span is chunks of things in which information has 
been encoded in various ways. For things like words, decimal digits, and 
indeed, any discrete series of items, there is not much difference in the num- 
ber of items that can be recalled immediately. Therefore, it looks as though 
people remember a constant number of chunks or items irrespective of the 
amount of information in each item. Miller reminds us that if an individual 
has an immediate memory span of five words, we can guess that he has an 
immediate memory span of 15 or 20 phonemes, since each word, on the 

2 Binary digits are numbers in which only two digits are used, zero and one. 
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average, will have three or four phonemes. But, says Miller, people do not 
code by phonemes; they code by words, ideas, and sentences. Thus if peo- 
ple can remember an average of two sentences of simple prose, these sen- 
tences represent information coded into five, six, or seven chunks. 

This process is called recoding. We hear something that has many chunks 
(say, words in a sentence) with relatively few pieces of information per 
chunk. We can take this sentence and recode it into fewer chunks (just the 
size to fit the immediate memory span) with more information per chunk. 
Thus we can get the idea” out of a sentence. We do not know exactly what 
this recoding entails in every instance, but frequently it is probably a way 

of grouping the pieces of incoming information by applying some familiar 
name to the groups or pieces. 

Miller reports an experimental attempt by one of his associates, Sidney 
Smith, to recode binary digits in an efficient manner. By regrouping the 
binary digits into octal units ® Smith could remember 40 binary numbers 
after a single hearing. How many items out of a sequence like this can 
you remember after one reading: 010110100100110101101011011100- 
1111010010? Perhaps as you read through the series you could find a 
way of recoding these. Unpracticed people would probably try reorganiz- 
ing by threes and read the sequence like this: 010-110-100-100-110, etc. 
This is not quite so efficient as recoding by assigning names to the pos- 
sible groups of three. If you practice assigning names, you should be able to 
bring your immediate memory span for binary digits up to 36. 

The point Miller wants to make is that memorizing of new material, the 
piling up, or storage, of additional verbal items in memory, is probably to 
a considerable extent simply the formation of chunks, or the reorganiza- 
tion of items into new groups. Finally, when we have reorganized the ma- 
terial into a small enough number of chunks we can recall all the material. 
Nonsense materials are harder to memorize than sensible verbal material 
simply because the latter comes already organized into chunks, whereas we 
ourselves have to form chunks for the nonsense material. 

Clustering, the serial reorganization of material with statistical depend- 
ency, and related effects are probably all ways of reorganizing material 
into larger and larger chunks. These chunks most likely represent some 
mediational process, and we shall have more to say about this when we 
come to a discussion of problem solving and thinking. 

3 Binary digits coded by assigning numbers 0 through 7 to groups of three. Thus 101 
would be coded as “5” and 100 as “4.” 
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Perceptual Memory 

Let us now turn briefly to some problems in perceptual memory. Memory 
for visual forms, particularly abstract visual forms like circles, squares, and 
combinations of these, has been the special province of gestalt psychology. 

The gestalt approach to perceptual memory. Gestalt psychologists have 
generally taken the view that there are “fields of organization” in perception 
and memory for perceptual events and that these are related to “physiologi- 
cal fields” in the cerebral cortex. They have considered the “memory trace” 
for a perceptual event to be autonomously altered by events intrinsic to the 
brain. In investigating the implications of these notions, they have studied 
the perception of figures in great detail and stated some rather simple rules 
that are supposed to govern the nature of changes in the memory traces for 
visual forms. 

In general, gestalt psychologists have supposed that the memory traces 
for forms become better structured over a retention interval. Memories of 
forms are supposed to become simpler than the original perception. Experi- 
mental studies have not always confirmed the specific predictions various 
gestalt psychologists have made about the direction and extent of the 
changes in forms during a retention interval, and the actual events seem to 
be more complicated than gestalt notions allow. 

There is one point which the gestalt psychologists have insisted upon, 
over the frequently vocal objections of more associationistic psychologists, 
and in this they have turned out to be absolutely right. This is the general 
proposition that there are systematic reorganizations going on during the 
retention interval. The memory trace for a perceptual event is not just a 
gradually fading picture of something we have seen in the past; it is some- 
thing that is being actively reorganized and reconstructed. 

It is likely that this reconstruction is much the same process as that we 
have just examined in the context of verbal retention. In other words, it is 
a reorganization that has as its roots certain recoding processes made pos- 
sible by the fact that our perceptual world obeys probabilistic rules and that 
we have all had experience with these rules. Some of the most interesting 
studies on the way in which these recoding processes work have come from 
the classical investigations of Bartlett (1932), and we shall now look at 
some of his more important conclusions. 

Bartlett’s studies. First of all, Bartlett points out, when we are faced 
with an unfamUiar form, we do not just try to receive an impression of it 
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as it stands. We try to reconstruct it in such a way that it is easy to assimi- 
late into our past experience with the world. There is, in Bartlett’s words, 
an effort after meaning. This may be so simple as to be merely giving a 
name to the figure. Unless the figure is very simple and almost completely 
specified by the name, the figure will be schematized in memory. For exam- 
ple, the form in Figure 49 was frequently labeled by subjects in a percep- 
tual-memory experiment as a “dog’s head” (Deese, 1956). Such a designa- 
tion will surely influence the way in which the form is retained. 

Bartlett gives us several examples of the transformation of visual form in 
memory by the operation of schematic recoding. One of the best-known 
examples is reproduced in Figure 50. In this experiment the original draw- 
ing in the upper left-hand comer was presented to a subject. He was asked 
to reproduce this drawing from memory. The result is reproduction 1. This 
reproduction was then presented to another subject, and this subject was 
asked to reproduce the figure. We can clearly see how the form was trans- 
formed by the various schemata adopted by the different subjects. In gen- 
eral it is changed from the conventional ancient Egyptian symbol for “owl” 
to a picture of a sitting cat. 

Bartlett describes at great length the various ways in which perception 
itself and remembering are transformed by schematization. The schematiza- 
tion partly depends upon the redundancy in visual forms (Attneave, 1954), 
and schematization based upon redundancy is likely to produce transforma- 
tions in memory in the direction of simple, more symmetrical forms. There- 
fore it seems likely that some of the gestalt laws of perception and perceptual 
memory have as their basis the redundancy that exists in most forms of our 



Figure 49. A ''nonsense'' form generally recoded by labeling when individuals are osked to 
Temember the form. The most frequent name assigned this form by a group of subjects wos "dog's 
head." (Deese, 1956.) 
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Figure 50. A form &erially reproduced from memory by different subjects. The form is altered at 
each reproduction. The influence of labeling as a coding device is seen in the change from a con* 
ventional Egyptian symbol for an owl to a picture of a cat. {After Bortlett, 1932.) 


experience.^ Redundancy, of course, means that some of the information pre- 
sented in a visual form is superfluous; thus we can recognize a well-known 
person from a caricature of a few lines about as well as we can from a 
photograph. 

Ways of schematizing. We have already mentioned some of the ways 
in which forms may be schematized for visual memory. They may have 
names attached to them or be reconstructed by some simple principle of 
design, particularly if there is high redundancy in them. Actually the ways 
in which schematizing may occur are numerous, and it will only be worth- 
while to illustrate a few. 

Simple forms may be reproduced even if they cannot be well schematized, 
much as a list of random digits may be reproduced in the immediate mem- 
ory span. If nonsense forms become too complicated, however, they must 
be learned by rote, and the length of time this takes is about the same as for 
any other kind of nonsense material with an equal number of items (Pieron, 
1920). The determinants of memory for any set of perceptual events, how- 
ever, are quite complicated. 

For example, if subjects are required to recognize rather than reproduce 
forms, recoding is likely to be quite different. They may recode by naming, 
but if names are difficult to assign to the forms, they may try to identify 
them by some isolated segment, characteristic of a form. If recoding is done 
this way it may turn out that a complicated form will be easier to recognize 
than a simple one, because the former will have a greater probability of 

^ See, for example, Attneave (1955) on symmetry. 
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having some really distinctive, isolated segment (Deese, 1956). On the 
other hand, if the subjects are required to view too many different forms, 
recognition for simple forms may be better than for complicated forms be- 
cause the number of distinctive segments the subject may have to try to retain 
for the complicated figures exceeds the immediate memory span. 

The importance of the limitation of recoding is illustrated in an experi- 
ment by French (1954). This investigator had subjects identify patterns of 
dots from memory. The subjects were required to learn a name associated 
with each pattern of dots, which were arranged randomly so that recoding 
by familiar names could occur only by chance. The results were that sub- 
jects could more easily recognize patterns with six to eight dots than they 
could patterns with fewer or more dots. Ease of identification was asso- 
ciated with patterns of dots arranged either symmetrically or in linear 
arrays. In other words there was better perceptual organization in the pat- 
terns of six to eight dots. With fewer dots there was less likelihood of organ- 
ization (and hence, of recoding), and with more dots the patterns became 
too cluttered. 

The ability to recode our visual experience, for one thing, appears to be 
a function of the frequency of exposure to certain forms. In one experiment 
(Amoult, 1956) subjects were exposed to nonsense forms a varying num- 
ber of times. These same subjects were later asked to rate these forms for 
familiarity. There was positive relationship between the frequency of ex- 
posure and familiarity, much as there is with unfamiliar words (Noble, 
1954). Furthermore there were positive relationships between the recog- 
nizability of the forms and their frequency of presentation and rated famil- 
iarity. 

Dynamic Factors in Memory 

If one reads enough books in psychology he will get used to seeing the 
phrase dynamic factors. Generally this refers to individual differences in 
personality, motivation, emotionality, and related factors. Nearly all psy- 
chologists assume that such factors influence memory. Sometimes particu- 
lar theories are quite specific about the ways in which dynamic mechanisms 
may influence memory, as is the case with the Freudian notion of repres- 
sion. More often, however, there is simply general agreement that the spe- 
cific personality mechanisms of an individual will influence things that he 
remembers. 

Individual styles in recoding. Again, we return to Bartlett’s studies 
(1932). One of the most influential aspects of these was his observation 
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that the nature and direction of schematizing, or recoding, is determined by 
the personal characteristics of the individual doing the recalling. The ex- 
tent to which personal characteristics determine the nature of schematiz- 
ing, of course, depends upon the material being recalled. In one of Bartlett s 
studies, he asked subjects to indicate their memories for pictures of faces 
by describing the faces after the subjects had seen them. This task appeared 
to be particularly susceptible to personal attitudes and motivation. A par- 
ticular face would be described in terms of the conventional attitudes to- 
ward the type it represented. Thus, a picture of an enlisted man in the 
army would be described as representing a young, good-humored person, 
etc., and a picture of an officer would be described as stern, elderly, etc. 
Obviously, the attitude aroused by the pictorial schema or the name as- 
signed to the picture by a particular subject influenced his subsequent de- 
scription of the picture. 

We can generalize this kind of observation by saying that the memory 
for a particular event is influenced by the schema the person uses to carry 
it in memory. The schema, in turn, is at the mercy of the individual’s mo- 
tivations and attitudes, some of which are specifically aroused by the origi- 
nal event to be remembered and some by a label or mediating reaction given 
to the event to be remembered. 

Such observations have been extremely important for certain problems 
in social psychology, and indeed, Bartlett’s studies have been as influential 
in this area as they have been in the psychology of learning. They provide 
an important point of contact between the principles of social and of indi- 
vidual behavior. 

Selective forgetting. Retention — or the loss of it, forgetting — seems to 
be selective in nature. That is to say, we remember some things and forget 
others, not because of the inherent associative strength of the material 
learned, but because they are related to our motivations and attitudes. In 
brief, it has commonly been assumed that people remember those things in 
which they are interested. 

One of the best-known studies usually cited to support this general prop- 
osition is by Levine and Murphy (1943). These investigators presented 
subjects with material dealing with the Soviet Union that was either favor- 
able or unfavorable. Before the study, the subjects had been divided into 
two groups on the basis of their attitudes toward communism — procommu- 
nist or anticommunist. Levine and Murphy obtained learning curves and 
retention curves for these two groups. It turned out that there was a marked 
relationship between the attitudes of the subjects and amount learned. Thus, 
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the procommunists were able to reproduce, during learning, a larger per- 
centage of the separate ideas from the procommunistic material than from 
the anticommunistic material. Unfortunately, however, because of the dif- 
ference in learning, the retention curves started at different places, and 
hence it is extremely difficult to tell whether there was selective forgetting 
or not. In other words, there was selective learning, but this tended to 
mask the possibilities of finding selective retention. 

Since selective learning seems to happen so much of the time, it is ex- 
tremely difficult to tell whether or not selective forgetting also occurs. One 
experiment at least suggests that it does. In this experiment (Taft, 1954) 
Negro and white boys were asked to recall material read to them aloud; 
the material contained favorable and unfavorable, ambiguous and neutral 
items about Negroes. On an immediate recall test the white boys recalled 
about as many neutral items as did the Negro boys, but the Negro boys re- 
called more emotionally toned items, both favorable and unfavorable, than 
did the whites. The greater recall for the emotionally toned material on the 
part of the Negro boys was probably influenced in part by the fact that the 
experimenter was white and was an authority figure of considerable im- 
portance. 

Thus far this experiment tells us littie that is new. However, the experi- 
menter went one step further and asked for a delayed recall three days after 
the original recall. On the delayed recall the Negro youths were still su- 
perior to the whites in the recall of fsivorable material, but they showed 
considerably more forgetting of the unfavorable items than did the whites, 
so much so, as a matter of fact, that the whites were slightly superior on the 
recall of unfavorable items. 

Why did the Negro youths forget the unfavorable items? Very likely be- 
cause they fitted less well into their emotionally determined schemata about 
successful Negro baseball players (this is what the material was about). 

In addition, however, there may have been deliberate suppression of the 
material (unwillingness to tell the experimenter about the unfavorable ma- 
terial) or repression. This last mechanism is extremely important, and we 
shall deal with it later. 

Once again we must emphasize the importance of the present assimila- 
tion of what the individual perceives with what he has learned in the past. 
This will determine what an individual learns about a new situation and 
probably what he retains of it. Aside from the emotional and attitudinal 
content of this problem it is not basically different from the general ques- 
tion of transfer of training. A new situation arouses a set of well-established 
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and preferred associations in the individual, and it is to these responses 
that parts of the new situation become selectively attached. When the time 
for recall comes, the individual is able to bring out those parts of the new 
situation that can readily be aroused by strong associations— those things 
that have become assimilated into the psychological life history of the 

individual. 


THE MECHANISMS OF FORGETTING 

We have seen that individuals tend to remember things which are related 
in a certain way to what they have learned in the past. In most of the experi- 
mental literature we have described in support of this general proposition, 
the amount of learning was uncontrolled. Thus we have been dealing with 
both learning and retention. If we have established, however, that some 
responses have all been equaUy well learned, what determines how many, 
and which, of these responses will be forgotten? The answer to this ques- 
tion is the subject of this section. We shall deal with studies in which in- 
vestigators have deliberately tried to produce forgetting and with the 

theories that have arisen out of these experiments. 

An assumption frequently made is that forgetting is simply a matter of 
lack of practice. A learned act, it is argued, disintegrates because time 
passes without any practice, much as a footpath will be overgrown with 
weeds if no one walks along it. This idea is reasonable enough, for in the 
natural course of events, our learning seems to slip away from us with the 
passing years. The schoolteacher who returns to the university for a summer 
session after five years of teaching the sixth grade makes a determined ef- 
fort to impress her French instructor with the fact that she has not looked at 
French in years and consequently will not remember much. 

Arguments against the principle of disuse. Forgetting believed to be a 
result of lack of practice usually is called disuse. There are many grounds for 
dismissing disuse as a serious notion in the psychology of learning. Mc- 
Geoch (1932) pointed out some of these in a classical paper on the nature 
of forgetting. The most important point that he made is that disuse does not 
explain forgetting if disuse only implies the passage of time, for time in 
itself does not cause anything. Events happen in time; certain conditions 
change over time, and it is these that provide explanations. Thus we find 
that forgetting is not determined by passage of time alone, but by the na- 
ture of the events which fill a time interval. An analysis of these events will 
lead to an understanding of the nature of the forgetting process itself. 
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Transfer and Forgetting 

If we once admit the proposition that forgetting is determined by events 
filling a time interval, it immediately becomes apparent that the experi- 
mental designs for the study of transfer of training are also those for the pro- 
duction of forgetting. Negative transfer, then, is admitted to be one of the 
mechanisms of forgetting. 


Retroactive Inhibition and Proactive Inhibition 

Experimental designs. As we saw in the chapter on transfer there are 
two types of basic experimental designs for the study of transfer. The retro- 
action experiment measures the effect of an interpolated activity on the re- 
tention of some previously learned act; the proaction experiment measures 

the effect of an older learned act upon the learning and retention of a new 
act. The basic designs are as follows: 


Proaction 


Experimental group 

Learns task 1 learns task 2 

Control group 


(recalls task 2) 


Yearns task 2 (recalls task 2) 

The mfluence of task 1 upon task 2 can be seen by examining either the 

learning of task 2 or the retention of that task sometime after its original 
learning. 


Retroaction 
Experimental group 

Learns task 1 learns task 2 tested on task 1 

Control group 

Learns task 1 rests tested on task 1 

The influence of task 2 is measured by a retention test (usually recall or 
relearning) of task 1. 

If the experimental condition impairs the retention of an old task (retro- 
action) or the learning or retention of a new task (proaction), negative 
transfer or inhibition has taken place. Hence the special names retroac- 
tive inhibition and proactive inhibition for the negative side of transfer. 
Most investigators have considered retroactive inhibition to be the more 
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important condition for the mechanisms of transfer in forgetting, but more 
recently proactive inhibition has been assigned an important role also. 

In order to see how the conditions of negative transfer have been applied 
to a theoretical analysis of forgetting, it will be necessary for us to examine 
some of the principal experimental findings in negative transfer. In the 
next few pages we shall look at some of the more basic experimental vari- 
ables. 

Negative transfer and stimulus-response relationships. In the chapter 
on transfer we examined a diagram, the transfer surface, which implied 
that the major conditions for negative transfer between two tasks were ( 1 ) 
common stimulus elements between the tasks and (2) different response 
elements. The diagram implied that, holding stimulus similarity between the 
tasks constant, transfer could be varied from positive to negative by chang- 
ing the responses in the two tasks from being identical or very similar, to 
being very different from each other. 

This diagram was drawn by Osgood (1949) to summarize the findings 
of many different experiments. In other words, the basic relationships pre- 
sented in the diagram are based upon empirical research. A large number 
of experiments, using both the proaction and the retroaction design, have 
shown that when stimuli are the same, but responses different, negative 
transfer results (Bruce, 1933; Gibson, 1940; Bugelski, 1942; Underwood, 
1945; Osgood, 1946; Morgan and Underwood, 1950). 

Thus, a safe generalization about paired-associate learning is that when 
the same stimulus items and different response items are used in two tasks 
there will be negative transfer. 

The transfer surface also implies that the direction and amount of transfer 
depend upon the degree of response similarity. As this similarity goes from 
a maximum (identity) to a minimum (as Osgood suggested, opposite 
words), transfer goes from maximum positive to maximum negative trans- 
fer. The empirical evidence, however, does not entirely support this rela- 
tionship between response similarity and transfer. 

Some experiments appear to uphold the notion that transfer varies con- 
tinuously as a function of response similarity (Morgan and Underwood, 
1950). Other experiments cast doubt on the generality of this relationship. 
For example, Bugelski and Cadwallader (1956) performed an experiment 
designed to test all of the implications of the transfer surface. In this ex- 
periment the greatest amount of retroactive inhibition was for responses in 
recall which were somewhat similar to responses in interpolated learning, 
rather than for responses which were neutral or opposite. 
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This is a complicated and difficult issue to settle. It may be partially the 
result of the amount of interpolated learning, as it was in an experiment 
(Deese and Hardman, 1954) cited in the chapter on transfer. On the other 
hand, it may be the result of the way in which the subjects are able to re- 
code responses. For example, if during interpolated learning, the subjects 
become aware that the responses required are all antonyms of the responses 
required in original learning, the net effect, both on interpolated learning 
and recall of the original list, may be positive transfer. At recall the subject 
remembers that one of the words in the interpolated list was “swift” and 
quickly thinks of the antonym “slow.” If, however, the words on the two 
lists are unrelated to each other, no simple recoding will do. Perhaps maxi- 
mum negative transfer would be obtained in situations in which responses 
are simply unrelated or are so similar that the subject cannot remember 
which word belongs to which list. Suppose, for example, that a subject 
remembers that “swift” was a response item for one task and “fast,” for the 
other; since they are similar in meaning, however, he has difficulty in recod- 
ing them to their proper lists. He is likely, therefore, simply to guess one 
or the other, and there is a 50 per cent chance of making a mistake. 

This matter of the relationship between recodings available to the sub- 
ject and response relationships in negative transfer still needs a thorough 
experimental exploration. 

A comparison of proaction and retroaction. Several investigators have 
asked the question, Is retroactive inhibition greater than the proactive inhi- 
bition of retention when the same materials are used to test both? This is 
important, because we wish to know whether these two situations are meas- 
ures of the same psychological effect or whether one involves something 
more than the other. 

One classical experiment on this comparison (Melton and von Lackum, 
1941) used nonsense syllables and the technique of serial anticipation. Un- 
der these conditions retroactive inhibition was greater than proactive inhi- 
bition of retention. That is to say, an interpolated task has more inhibiting 
effect on the retention of an original task than an original task has on the 
retention of a task learned second. This finding will not violate the layman’s 
prejudices, since in the proactive experiment the more recent task is meas- 
ured for retention, and in the retroactive experiment the earlier task is the 
one measured. 

If, however, a time interval is allowed between original learning and the 
test for retention, the difference between proaction and retroaction disap- 
pears (Underwood, 1948). This again, perhaps, will not surprise the lay- 
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man, since the order difference between the tasks in proaction and retroac- 
tion will be slight compared with the time difference of a long retention 
period — if there is a long enough time lapse, the order of the tasks becomes 
unimportant. Thus, while there is a difference between retroactive and pro- 
active inhibition when the retention test is immediate or follows soon after 
original learning, the difference disappears if the retention test is delayed. 
This effect is largely the result of recovery from retroactive inhibition ratlier 
than increase in proactive inhibition (Underwood, 1948 ), though some in- 
crease in proactive inhibition probably does occur after a long retention 
interval (Duncan and Underwood, 1953 ). 

Proaction, retroaction, and decree of learning. If we compare the ef- 
fects of learninu and retention of a second task after differinc amounts of 
practice on the first task, we find that both positive transfer and proactive 
inhibition of retention increase as a function of the degree of prior learning 
(Atwater, 1953). That is to say, the degree to which the acquisition of a 
new task is aided by previous learning depends upon the amount of practice 
on the original task (such aid is probably largely “learning how to learn”), 
but by the same token, the greater the learning of a prior task, the more 
it will interfere with retention of the second task. 

It is not clear, however, in what way amount of practice on the prelimi- 
nary task and on the second task interact. We do know that “overlearning” 
of a single task makes it more resistant to ordinary forgetting. If the pre- 
liminary task is “overlearned,” the responses in it may be so thoroughly 
associated with one another that the learning of a new set of responses 
will not interfere with them. 

In the retroactive design increased practice on an interpolated task has a 
more complicated effect, at least in rote verbal learning. With a fixed num- 
ber of trials on original learning (nonsense syllables), an increase in prac- 
tice on the interpolated task first results in an increase in retroactive inhibi- 
tion. After the interpolated task has been practiced to a much higher level 
than the original task, however, further practice on the interpolated task 
seems to decrease the amount of retroactive inhibition (Melton and Irwin, 
1940; Thune and Underwood, 1943). Thus, it is possible that if the tasks 
are highly differentiated by overlearning either the original or the interpo- 
lated task, retroactive inhibition may be less, at least in an immediate test 
of retention. 

There is still some doubt, however, about how general this finding may 
be. For motor learning, at least one experiment (Lewis, Smith, and Mc- 
Allister, 1952) shows that the amount of retroactive interference is simply 
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inversely related to criterion level on an interpolated task. Another experi- 
ment on rote verbal learning (Briggs, 1957) shows that retroactive inhibi- 
tion increased to an asymptotic limit with an increase in practice of the 
interpolated task, but did not decrease. This, however, may have been 
simply because practice on the interpolated task was not carried far enough. 

In general, we can conclude that inhibition (either proactive or retroac- 
tive) grows with an increasing amount of practice on the interfering task, 
but it is likely that there is a limit to this increase; it may even be that with 

very high amounts of practice on the interfering task, the inhibiting effects 
may decline slightly. 

Theories of Retroactive and Proactive Inhibition 

What factors are responsible for proactive and retroactive inhibition 
effects? If we regard the experiments on these effects as models of the ex- 
perimental production of forgetting, then a reply to this question would 
provide us with some of the factors responsible for the forgetting process. 
The most frequently given answer at the present time is that competition 
between responses is one reason for both proactive and retroactive inhibi- 
tion. Before we examine some of the implications of this notion, however, 
we shall examine one of the other possible theories of inhibition, one which 
has been conceived of as interruption, or disruption, of a memory trace. 

Interruption of a memory trace. The classical theory of retroactive in- 
hibition is usually known as the “perseveration theory,” and the essence of 
this is that after a series of responses has been practiced, the effects per- 
severate for a period of time (presumably in the central nervous system) 
and that any other activity will interfere with these effects. This notion, 

incidentally, has been advanced to account for the distribution of practice 
effect. 

As far as retroactive inhibition is concerned this idea is of limited use. 
For one thing, it implies that retroactive inhibition should occur most 
strongly when a test for retention of interpolated learning is given immedi- 
ately after interpolated learning, and indeed, it should not occur at all if the 
test is delayed too long. This is contrary to fact (Postman and Alper, 1946). 

Also against this notion as a general explanation of retroactive inhibition 
is the fact that it does not predict the rather important effects of similarity 
(both with respect to stimuli and responses) upon retroactive inhibition. 
Furthermore, it does not allow for an explanation of retroactive inhibition 
and retroactive facilitation by the same mechanisms. Finally, it must lead 
one to the conclusion that retroactive and proactive inhibition of retention 
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are essentially different phenomena, since the perseveration notion cannot 
explain the latter. 

These facts have led contemporary theorists to abandon perseveration as a 
general theory for retroactive inhibition. Nevertheless, there may be some 
facts about learning which demand a construct such as perseveration, or at 
any rate, suggest the role of interruption of the memory trace immediately 
after it is formed as a factor in forgetting. For one thing there is the existence 
of the immediate memory span. Why is our memory span for unrelated items 
limited to about seven items? Why cannot we hear a long string of numbers 
and then be able to recite them as well as we can when we split the string 
in two and recite the two halves separately? The fact that we cannot sug- 
gests that the later items we hear somehow interfere with our memory for 
the first items before we get a chance to emit or recode them, 

Hebb (1949) has suggested that there are two phases to the neural trace 

of something learned, dynamic and permanent. The dynamic phase lasts 

but a short time, and, Hebb argues, it depends upon something like active 

recurrent circuits in the nervous system. If anything acts to interfere with 

the event to be remembered before it is transformed into a more or less 
Static memory trace, it is disrupted. 

Such an effect, however, probably has little to do with retroactive inhibi- 
tion as it ordinarily occurs. The “traces” established in a retroactive inhibi- 
tion experiment are durable and exist much longer than the few seconds in 
immediate memory span. Therefore the notions about consolidation or per- 
severation of traces are not useful as explanatory constructs for the retroac- 
tive and proactive inhibition effects. They may be useful in explaining some 
of the fundamental facts of learning, but thus far there has been little inter- 
est in theories of learning built on the notion of consolidation of traces. 

Competition theory. The most specific of the several related concepts 

that have been advanced to account for retroactive and proactive inhibition 

IS the hypothesis of response competition. This asserts that there wUl be 

mutual interference between two sets of responses to the extent that the 

responses from the two sets compete with one another at recall. The extent 

to which one set of responses will compete with another depends upon the 

relative associative strength of the sets of responses to common stimuli and 

the inabJity of the individual to recode the responses in such a way as to 
give them distinctive contexts. 

The effects of competition may be expressed in several ways: (1) The 
individual may be able to recall both sets of responses but may not be able 
to recognize which one is correct. (2) The individual may be able to recall 
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only one of the alternative responses and not the other. Competition here 
may be indicated either by {a) the recall of the incorrect response or {b) 
a longer reaction time for the recall of the correct response. (3) Finally, 
competition may depress all alternative responses, so that for everything 
except probing by a forced-choice method, the individual may not be 
able to recall either set of responses. Indeed both sets may be depressed 
enough for a third “irrelevant” set of responses to occur. 

Competition can occur only in a situation in which one of the alternative 
sets of responses is correct. If either one of the two alternative sets will do 
in response to a particular situation, competition, by definition, does not 
occur. Furthermore, competition will not occur when the individual can 
recall both sets of responses and both are acceptable. Neither of these condi- 
tions usually arises in formal tests of retention. For example, in answer to 

a completion item such as “the receptors for color vision are the 

an instructor would accept only “cones,” not “rods or cones.” 

Implications of the competition hypothesis. The competition hypothe- 
sis, even in the bare outline presented above, has several rather specific 
implications. For one thing it implies that the degree of competition between 
habits will depend primarily upon the associative strength of the habits for 
identical or related stimuli. In other words, the stronger a competing re- 
sponse, the more it will compete. Furthermore, if associative strength be- 
tween the common stimuli and the two sets of responses is equal, then 
temporal order should have no effect. This means, all other things being 
the same, that there should be no difference between retroactive and pro- 
active inhibition of retention. Likewise, there should be no difference in 
amount of retroactive inhibition produced by variation in the temporal 
position of interpolated learning. 

Another set of implications of the competition hypothesis is that both 
proactive and retroactive inhibition should be determined by the similarity 
relationships between tasks. For example, if subjects learn two distinct sets 
of responses to stimuli, competition at recall will be greater if the stimuli with 
which the responses have been associated are identical rather than similar. 
In other words, something like stimulus generalization will operate. 

The competition hypothesis does not have much to say about response 
similarity, however, except that if responses are so imperceptibly different 
that either response will be accepted as correct, then positive rather than 
negative transfer will occur. Once the responses are distinct from one 
another, however, the role of degree of response similarity in the competi- 
tion hypothesis is not clear, since recoding is possible. Perhaps this is just 
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as well, for we have seen that the experimental evidence on negative trans- 
fer and response similarity is not unequivocal. 


On a number of these points the data are not really in disagreement with 
the implications of the competition hypothesis. To be sure, in a test of 
immediate recall there is likely to be a difference between the retroactive 
and the proactive inhibition of retention, but this disappears in a delayed 
recall test. The difference on initial recall may have to do with subtle 
factors of set and warming up. Certainly, in the main, the transfer surface 
IS in accord with the implications of the competition hypothesis. The data 
on the temporal point of interpolation in retroactive inhibition are contra- 
dictory and confused, which is perhaps exactly what one would expect from 
the competition hypothesis. 


Since competition is determined by the associative strength of the com- 
peting activity, we should expect that retention in retroa'ctive inhibition 
would vary inversely with the amount of interpolated learning and that re- 
tention in proactive inhibition would vary inversely with the amount of learn- 
ing on the original task. Again, this appears to be the case, with the possible 
exception of high degrees of interpolated learning in retroactive inhibition. 
The slight drop in retroactive inhibition that occurs in such a situation may 

be the result of differentiation of the two response systems through efficient 
recoding. 

Direct transfer and the competition hypothesis. Perhaps more impor- 
tant than these indirect implications of the competition hypothesis are direct 
indications of overt negative transfer. Direct expression of competition 
can occur in two ways: (1) the reaction time for correct responses may be 
increased by the need to suppress incorrect alternatives at recall, and (2) 
overt intrusions of responses from the competing set of responses may actu- 
ally displace correct responses in recall. 


Both these effects occur. Postman and Kaplan (1947) have observed that 
reaction time during relearning of the original responses is increased as the 
result of interpolated learning. From this increase it is easy to infer that the 
correct responses are inhibited for a time by an occasional tendency to sav 
the appropriate word from interpolated learning. Everyone has had the 

hk something without collecting 

h. thoughts tor a moment, and this generally means picking the correct 
response from a number of alternatives. 

Of critical importance to the competition theory has been the observation 
f direct intrusion of responses from the competing set of habits. Ordinarily 
rote learning experiment, most of the “errors” are simply failures of 
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the subject to respond within the assigned time limit. Sometimes, however, 

subjects will respond with inappropriate responses, and these have provided 

a direct mdex of competition. The very fact that such intrusions occur is 

direct and conclusive evidence that competition is responsible for at least 
part of interference with recall. 

Melton and Irwin (1940) noticed that the frequency of intrusions of 
interpolated responses during the recall of an original task was greatest 
when the amount of practice on the original and interpolated tasks was 
about the same. If there was more practice on the interpolated task, in- 
terpolated responses tended to intrude less frequently during recall of the 



Figure 51. Retroactive inhibition as a function of the number of trials of interpolated learning. 
The dashed lower curve shows the inhibition attributable to overt intrusion at recall. The upper 
dashed curve shows inhibition attributable to factor X. {Melton and Irwin, 1940.) 


original material. This again is very likely the result of efficient recoding 
which leads to clear differentiation of the two sets of responses. Unfor- 
tunately, total retroactive inhibition continued to increase beyond the point 
at which intrusions of interpolated responses began to decrease. This situa- 
tion is illustrated in Figure 51; the curve labeled “RI attributable to factor 
X” is that portion of the total retroactive inhibition that could not reason- 
ably be accounted for by intrusions from interpolated learning interfering 
with the recall of original responses. The existence of this factor led Melton 
to a sweeping revision of the competition theory, and the resulting revision is 
usually called the “two-factor theory” (note: this two-factor theory is not the 

same as the two-factor theory of conditioning we considered in early 
chapters). 
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The two-factor theory. The competition of responses is clearly one 
factor in retroactive inhibition. Perhaps, however, other factors are at work 
in retroactive inhibition, and possibly also in proactive inhibition. One of 
these seems to be specific to retroactive inhibition. As a matter of fact, a 
difference between retroactive and proactive inhibition of retention in im- 
mediate recall has been advanced as one of the reasons why a second factor 
in retroactive inhibition is needed. Melton (Melton and Irwin, 1940; Melton 
and von Lackum, 1941) has suggested that this second factor is the “un- 
learning” of original responses during interpolated learning. 

If interpolated responses intrude at recall of original learning in retroac- 
tive inhibition, it seems reasonable to suppose that there would be also 
intrusions of original responses during interpolated learning. And, indeed, 
there are. When responses from original learning occur during interpolated 
learning they are incorrect, and hence, it may be argued, are weakened. 
Thus interpolated learning has the double function of ( 1 ) building up a set 
of competing associations and (2) weakening the set of original associa- 
tions. In proactive inhibition of retention only competition can work, because 
the competing responses are learned first, and thus the correct responses 
cannot be “unlearned” because of practice on the competing responses. 

It is extremely difficult to obtain any direct evidence for unlearning, be- 
cause we do not fully understand how it works. Many experiments, how- 
ever, give indirect evidence for the existence of unlearning (Melton, cited 
above; Underwood, 1945, 1948). More recently, Briggs (1954, 1957) has 
charted the course of responses appropriate to original learning and those 
appropriate to interpolated learning by means of specially prepared recall 
tests inserted at certain points during learning. These tests show that the 
frequency of original responses declines as interpolated learning is con- 
tinued. In other words original learning is weakened. Furthermore, these 
responses “spontaneously” recover if a time interval is allowed between 
interpolated learning and a test for retention of the original responses. 

This last observation, which has been frequently made (Underwood, 
1948, etc.), has led to the view that the unlearning of original responses 
during interpolated learning is much like the process of extinction. This 
comparison explains nothing, but it reminds us that unlearning, like extinc- 
tion, is in part only temporary in nature. Furthermore, again like extinction, 
just a little relearning can cause all traces of unlearning to disappear. 

Thus, it seems likely that the retroactive inhibition effect has at least two 
causes, ( 1 ) unlearning of the original responses during the learning of the 
interpolated material, and (2) competition between responses at recall. 
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Proactive inhibition involves only the second factor. The relative im- 
portance of the two factors can be assessed by the fact that there is not 
much difference between retroactive and proactive inhibition of retention 
when conditions are otherwise the same and little or no difference when an 
interval of time is allowed to elapse. Thus competition at recall would seem 
to be the overwhelmingly more important of the two factors. 

The generality of competition. Looking back at Figure 47 near the 
beginning of this chapter we see that Ebbinghaus’s retention curve for non- 
sense syUables fell off very rapidly after original learning. Twenty-four 
hours after learning, Ebbinghaus showed a savings of about 35 per cent 
in relearning the same list. The general picture we get from textbooks on 
the psychology of learning is that such a result is fairly typical of the for- 
getting of nonsense syllables. Yet, more than one modern investigator has 
been puzzled to discover, by using college students as subjects (who are 
much less practiced at learning nonsense syllables than Ebbinghaus was), 
that their retention is much better after a comparable time interval. The 
students in the author’s course in experimental psychology, for example, 

typically show 75 to 90 per cent savings after 24 hours. What accounts for 
this difference? 

Earlier we pointed out that there were two basic designs for the study 
of retention. In one, the same subjects learned a different list of nonsense 
syllables for each retention interval; in the other, different subjects learned 
the same list of nonsense syllables. This accounts for the difference between 
investigations like Ebbinghaus’s which show low retention after a time 
interval and other studies which show relatively high retention. 

Underwood (1957) has shown how this difference is of fundamental 
importance for a general interference theory of forgetting. He demonstrates 
that the amount of forgetting of any one set of material is an increasing 
function of the amount of similar material the subjects have learned in the 
past. The curve in Figure 52 shows this relationship. Underwood obtained 
this figure by plotting the data from a large number of investigators; it shows 
very clearly that the per cent frequency of recall is inversely related to the 
number of previous lists the subjects have learned. , 

The importance of this point is twofold. First of all, it clearly shows that 

the ordinary retention curves, in which no experimental attempt is made 

to produce forgetting, are determined by the same variables in the proac- ' 

live and retroactive inhibition experiments. This fact places the competition, / 

or more generally the interference theory of forgetting, on a much firmer 
foothold. 
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The second point is even more important. Underwood reminds us that 
most of the emphasis in the experimental production of forgetting has been 
on the retroactive inhibition design. But these data on the influence of 
previously learned material upon the retention of more recently learned 
tasks suggest that the proactive inhibition of retention is really the more 
important design in the experimental study of forgetting. 

To be sure, if the two-factor theory of retroactive inhibition is correct, 
retroactive inhibition involves a second factor, unlearning, not present in 
proactive inhibition. We have seen, however, that the unlearning factor is 



Figure 52. Recall os a function of the number of previous lists of similar materiel learned by the 
subjects. This curve vras obtained by combining the data of a number of different investigators. The 
authors of the separate studies are indicated by the names beside the circles. (Underwood, 1957.) 

much less important than the competition, or general interference, factor, 
and furthermore it appears to disappear to a considerable extent with rest. 

Therefore, what produces forgetting is, to a considerable extent, simply 
mutual competition among things we have learned. Fortunately, most of us 
are safe from the possibility of a complete collapse of our associative proc- 
esses by the fact that we ( 1 ) are continually relearning the things we need 
in our daily life and (2) we have devised recoding devices for keeping im- 
portant items free from competition. 

Thus I may glance briefly at a set of notes before I go into the classroom 
to deliver a lecture. This serves to reactivate the coding devices by which 
I have stored many chunks of information. Just a word or two will enable 
me to recall a rather complicated theory, and I find as I talk that stories 
(alas, too familiar) and anecdotes flow automatically. 
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Direct competition and indirect interference. The method of paired 
associates provides the ideal model for the competition theory of forgetting. 
In the classical paired-associate experiment the subject learns two sets of 
responses to identical stimuli. When the stimuli are presented at recall 
there is competition between the two sets of responses. In actual practice, 
in an experiment on retroactive inhibition, competition produced by the 
alternate set of responses can account for nearly 70 per cent of the errors 
made by subjects at recall (Deese and Hardman, 1954). 

Unfortunately such direct evidence for competition does not always 
occur m experiments on serial learning by the method of anticipation. If 
subjects are asked to learn two lists of nonsense syllables by the method of 
anticipation, retroactive inhibition occurs when recall of the first list is 
asked for; the second list reduces the tendency to recall the first. Direct 
competition from the first list, however, accounts for a negligible number 
of overt errors at recall (about 5 per cent) when recall for the first list 
comes right after the learning of the second one. What happens in this case 
is that the learning of the second list causes the subjects to confuse the 
proper location of items within the first (Deese and Hardman, 1954). Thus, 
while the second list does produce retroactive inhibition, it does so by pro- 
ducing confusion within the original list rather than by directly competing 
with it. 

Thus direct competition after interpolated learning is disappointingly 
small in serial learning. Apparently, what happens in this case is that the 
rather weak organization within the original list is interfered with. The 
contextual boundaries between lists, probably established by some primitive 
coding, are strong enough so that the two lists do not directly interfere with 
one another. These contextual boundaries are not strong enough, however, 
to remain intact with the passage of time. If time intervals of a few hours 
or a few days are interposed between interpolated learning and recall of 
original learning with serial lists, there is competition at recall from the 
interpolated list (Deese and Marder, 1957). Thus direct competition occurs 
if a time interval elapses. The time interval probably weakens the contexts 
which held the lists apart; we should like to think that this weakening of 
context is also by interference, but we have no direct evidence for this. 

The point to be made is that interference may not always be exhibited 
in the form of direct competition at the time of recall. These are not 
grounds, however, for discarding the notion of competition in such cases. 

We pointed out earlier that competition may depress all “relevant” re- 
sponses so that irrelevant responses can take over. Thus, the addition of a 
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second list to the first does not have the effect of producing competition 
directly, but this addition at fairly low strength (because of the contextual 
boundaries between the lists) does depress the associations from the first 
list sufficiently so that their correct order is not maintained. 

In conclusion we can say that direct competition is an important de- 
terminer of the efficiency of recall. In addition, competition probably affects 
recall indirectly by competing with the correct responses so that other, 
irrelevant responses can displace them. Probably the conditions most 
directly affecting the extent to which competition takes place are similarity 
among the stimulus conditions associated with particular responses and the 
degree to which these responses are assimilated (recoded) into the indi- 
vidual’s past history of learning. 

Repression 

The work of Sigmund Freud has been enormously influential in nearly 
all of the social sciences. The psychology of learning is no exception. The 
concept of anxiety as a drive, the use of the technique of free association 
for probing emotional processes, both have their origin directly or indirectly 
in the theories of Freud. Since Freud touched upon nearly all dynamic 
processes in human behavior, he had something to say about forgetting. In 
this section we shall consider one of the most important of the possible 
forgetting mechanisms suggested by the work of Freud and his followers. 

The Concept of Repression 

Freud (1925) pointed out that one of the most effective methods an indi- 
vidual can use to protect himself from the conflict between powerful in- 
stinctive urges and the restrictive demands of the ego is through the 
repression of these urges and everything connected with them. This means, 
first, that the existence of these urges is not allowed to intrude into the in- 
dividual’s conscious life, and secondly, that all mental events associated with 
these urges are pushed out of consciousness. Thus repression has two 
forms. In the most fundamental sense repression refers to the blocking of 
motivation. In a secondary way, however, it refers to an expulsion of certain 
associations from the conscious life of the individual. Repression requires a 
constant expenditure of psychic energy, since it must hold down a drive, 

which itself is expending psychic energy. Thus repression is an active 
process. 

The fact that repression is active does not mean, however, that the indi 
vidual consciously tries to keep threatening ideas or wishes out of his mental 
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life. On the contrary, such ideas would have to be admitted before they are 
suppressed. Freud is arguing that they are not allowed to come into con- 
sciousness in the first place. Thus, repression is a dynamic conflict at an 
unconscious level. 

For our purposes, one of the really fundamental aspects of the concept 
is that anything that is repressed is not lost. It is only submerged below the 
level at which, under ordinary circumstances, the individual can volun- 
tarily recall it. Furthermore, it does not take relearning to reactivate this 
material. If the source of conflict is removed, the repressed memory will 
recover without specific practice. It is this feature of potential recovery that 
is of critical importance to the role of repression in forgetting. 

Experimental Evidence for Repression 

Repression is a concept designed to explain certain manifestations of 
personality, certain kinds of amnesias, the course of an individual's behavior 
during psychoanalysis, and certain effects under hypnosis. These things, 
however, only suggest what the role of repression is for associative proc- 
esses. Can ordinary associations be repressed? The only way to answer this 
question is to attempt to produce a process similar to repression in the 
laboratory. 

Unfortunately, the experimental evidence on repression is weak. This is 
partly because repression is not a well-defined concept (we may get involved 
in an endless quarrel over the meaning of the “ego,” for example) and 
partly because experiments on the production of emotional reactivity, which 
is necessary to repression, are difficult to perform and interpret. The essen- 
tial requirements for an experimental demonstration of repression are ( 1 ) 
the establishment of equal degrees of learning of some material in a control 
and an experimental group of subjects, (2) the imposition of some ex- 
perience likely to produce repression in the experimental group, (3) indica- 
tion of repression through forgetting, and (4) recovery from repression 
after the threat imposed in (2) is removed. 

Several experiments meet these requirements. For example, Zeller (1950, 
1951) managed to associate the retention of nonsense syllables with the 
threat of personal failure. Under these conditions subjects failed either to 
recall or relearn the syllables as well as a control group of subjects. When 
the threat was removed, recovery of memory occurred. A similar experi- 
ment by Abom (1953), which used somewhat different learning materials 

and the measurement of “incidental” memory, showed essentially the same 
results. 
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These experiments leave many questions unanswered. From the stand- 
point of the classical theory of repression we may wonder about the role of 
consciousness. Presumably the subjects in these experiments were motivated 
to recall the material when under the repression conditions but were simply 
unable to do so. This is probably the only thing we can say, and in this 
sense these experiments fall short of psychoanalytic theory (though per- 
haps the role of consciousness in the theory needs examination). More 
important questions for our purposes concern the nature of the associations 
themselves under repression. What happens to the organization of the ma- 
terial? What is the role of associative strength? Can very strong responses 
or highly coded systems of responses be repressed? Is it possible for us to 
repress, say, the names of our parents or grandparents? These are all ques- 
tions which cannot be answered by the experimental data we have, and 
perhaps they are beyond any of the experimental techniques available to 
us. Nevertheless, such matters need to be settled before the concept of 
repression can take the important role it deserves in the psychology of 
learning. 


In Conclusion 

We have seen that interaction between responses learned at different 
times is an important determiner of forgetting. We may well ask, however, 
whether or not forgetting ever really occurs. If two responses compete at 
one particular time, the stronger one will win out, but in order for competi- 
tion to occur, both must be there. Are responses then ever really elimi- 
nated? There is, of course, no way to answer so general a question. Never- 
theless we can say that the residual effects of something presented in the 
past can be detected after long intervals of time and under very improbable 
circumstances. Burn (1941) describes an experiment with his son, in 
which he read the boy passages, in Greek, from Sophocles’ Oedipus 
Tyrannus. There were three selections of 20 lines each, and for a period of 
time Burtt read these selections to the boy every day. The child was less 
than two years old. Six years later, he was required to learn by rote these 
selections plus some new ones from the same source. It took the boy an 
average of 435 repetitions to learn the new selections and only 317 repeti- 
tions to learn the old ones. Thus, there was savings of better than 25 per 
cent. 

Furthermore, we need to point out that the mechanisms of forgetting 
discussed in this chapter are not necessarily the only ones. Forgetting is a 
complex phenomenon, and there are undoubtedly many subtle features of 
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it that we have not yet begun to explore. We are just beginning to under- 
stand the important role of recoding in memory, both in retaining and 
altering things as they are retained. We need to know much more about how 
competition and interference interact with recoding. It is probable that once 
the material is in recoded form, it is relatively resistant to competition, but 
it may suffer considerably before it is thoroughly recoded. In addition, of 

course, in its recoded form it may be at the mercy of changes in motivation 
and attitude. 

In short, our past experience is being continuously altered by new ex- 
periences and changes in our interests and attitudes. Thus memory is neither 
a static thing nor the gradual decay of what was once learned. Our past is re- 
shaped for us by what we do now as much as what we do now is directed 
by our past. 


CHAPTER 1 1 


PROBLEM SOLVING AND THINKING 


We regard the ability to think as the supreme accomplishment of the evolu- 
tion of mind. That we should feel this way is understandable, for thinking 
is the thing in which we human beings undeniably excel. The very fact 
that we characterize human thinking as a product of the evo lution of mind , 
however, suggests that it has its antecedents in animal mentality. Further- 
more, the actual process of thinking and solving problems is learned, and 
we have already seen that many basic principles of learning are common 
to man and to higher animals. Consequently, thinking and its analogue in 
animal behavior must have an intimate connection with the basic principles 
of learning. In this chapter we shall attempt to outline some of these con- 
nections. 


PROBLEM SOLVING IN ANIMALS 

In the 75 years or so that psychologists have been examining animal 
behavior in the laboratory, an enormous number of examples of the ability 
of animals to solve problems has accumulated. Not all of these will help 
us to understand the nature of problem solving, but a few important ones 
will show how theories of problem solving have arisen. 

A problem exists when an oiganism is motivated to achieve some goal and 
there exists no ready means to reach it. Problem solving is the process by 
which the animal achieves the goal. Since some examples of problem solving 
merely illustrate the basic principles of instrumental conditioning, problem 
solving extends down to the most elementary examples of learned instru- 
mental behavior. Other examples of problem solving are quite complicated, 
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however, and the processes involved are evidently more elaborate than 
those in simple instrumental conditioning. 

Two Classical Viewpoints on Problem Solving 

Thorndike’s Trial-and-error Problem Solving 

The term trial and error applied to the process of problem solving was 
introduced by C. L. Morgan (1894), but its importance comes from some 
well-known experiments by Thorndike (1898). Thorndike reported exten- 
sive observations on the way in which animals, mostly cats, learned to 
escape from puzzle boxes. 

These puzzle boxes were simply enclosures from which the animal could 
learn to escape by turning a door latch, pulling a string, or pressing on a 
treadle attached to the door. Almost every student of psychology knows 
that Thorndike’s cats were very slow and ineffective in solving this problem. 

Like most cats, these evidently did not relish being penned in, and they 
would attempt to escape almost as soon as they were placed in the box. 
These attempts, however, were neither systematic nor fruitful. After vari- 
able amounts of time spent thrashing around, the cats would, quite acci- 
dentally as a rule, happen upon the device for opening the door. Usually, 
as soon as the door was open, the cats could leave. When they were re- 
turned to the box, however, they usually began their futile direct escape 
attempts all over again. It generally took quite a few trials for them to learn 
to turn directly to the device that opened the door, and the learning was 
usually an irregular, hit-or-miss affair. 

These cats did not appear to be particularly intelligent. They showed 
little ability to grasp and retain the principle of the puzzle box after having 
once happened upon it. Thorndike tested a number of other kinds of ani- 
mals, and while, in general, monkeys and dogs did somewhat better than the 
cats, he was not greatly impressed with the need for supposing that any very 
high level of mental activity took place in his animal subjects. 

These experiments on the puzzle box do not differ in principle or much 
in detail from the experiments on the way rats learn to press levers in 
Skinner boxes. As a matter of fact, Thorndike’s experiments led him to 
state the principle of effect as the basic mechanism in learning. The principle 
of effect refers to what we described in earlier chapters as the principle of 
reinforcement m instrumental learning. He pointed out that the essential 
element in the learning of instrumental behavior is the association of some 
response with a reinforcement. Gradually, a response so reinforced gains 
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prepotency over other responses and comes to dominate the behavior of the 
animal in the testing situation. The behavior of animals is blindly de- 
termined by the circumstances of reinforcement. There is little in animal 
behavior to suggest the need for more complicated processes, or at least 
these seemed to be the implications of Thorndike’s work. 

Of course many investigators of animal behavior resented the implica- 
tions of the pure trial-and-error description of animal problem solving. 
Both before and after Thorndike's experiments, numerous investigators pre- 
sented evidence for what they believed to be far more complicated proc- 
esses in animal learning and problem solving. 

The notion that Thorndike was grossly in error and that problem solving 
in animals is a much more complicated and intelligent process than he sup- 
posed finally came to center around the work of W. Kohler on the intelli- 
gence of anthropoid apes. We shall briefly review Kohler’s work and the 
implications that have been drawn from it. 

Kohler's Insightful Problem Solving 

.Kohler (1925) studied in great detail the problem-solving behavior of 
anthropoid apes, particularly chimpanzees. He was critical both of Thorn 
dike’s results and methods, and he designed problems for animals which he 
felt would more adequately probe their mental processes. 

Kohler studied the ability of his animals to solve a wide variety of instru- 
mental problems./ These were always designed, in contrast to Thorndike s. 
so that all the elements necessary for solution could be perceived by the 
animals. There were no trick doors or hidden mechanisms that delivered 
food.jFor example, in some of Kohler’s best-known observations, he simply 
suspended the animal’s daily ration of food from the roof of the animal's 
cage. A box was placed some distance from where the food was suspended. 
The problem for the animal was to perceive the relation between the box 
and the location of the food and to act accordingly. 

In one case, Kohler reports, an animal spent a fruitless day trying to 
scramble up to the food by various means. When Kohler demonstrated the 
solution to the animal, however, by placing the box under the food and 
touching the food whUe standing on the box, the animal immediately got the 
solution. There was no trial and error, no contingency between responses 
and reinforcement. In other examples of the box problem in which the ani- 
mals more or less arrived at their own solutions, Kohler thought that the 

animals showed clear evidence of foresightfully planning behavior in ad^ 
vance. 
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-°d error. Since 

ohler believed that his animals did solve the problems by foresightful plan- 
mg he characterized their solutions as insightful. By this he meant that ani- 
als did not accidentally happen upon a solution; instead, they saw the rela- 
,o„sh,p between ,h. of , ptobl.tn in ,„ch a wa^ that Jy a'nt, 

Thia means in effect that the animals tried out hypotheses based upon 
he pereced relat.onships in the environments SomeLes. of course, 

is^not tS" “ incorrect hypothesis, Kohler argued, 

iwt the same thing as blind, unplanned trial-and-error behavior. 

How do we know when insight occurs? It is necessarily an inference 

Z of P-blem sofution thaMed 

ohler and other psychologists to identify solutions as insightful are many 

trmedTn th' of solution. This is well illus- 

oluhon rff when the 

ari Chari ‘^at trial-and-error solutions 

the solution on the third trial than it did on the second. tL is generally 
tiue for insightful solutions. Once the animal gets the solution, it can 

K^eir H “Pon being exposed to the problem. Finally, 

Kohler claimed, it is possible to infer from the behavior of the animals that 

ey were able to put together objects in the environment that were separate 

and unrelated. These characteristics of insightful solution are illusiated 
in an example which is taken from Yerkes (1943) 

-in obirT'" ""I with 

Tete! I H 1 70 centi- 

aTthe ‘°P °f box 

couM be I V T experimenter, and a banana 

d be locked inside while the ape watched. A pole was also placed in 

he room, the pole was just the length of the box. Thus, the ape could get 

the banana by pushing it out one end of the box with the pole. 

doorllT th P''y °P®" ‘be latch on the 

door in the middle of the box. Gradually it lost interest in this endeavor 

and began to show signs of frustration. It even attempted to induce the 

soZTtTZ Z P°““’ however, the animal 

n?.r H ir' '"ben it began playing with the 

pole and rolling ,t around the room. The pole rolled into posiLn alongside 
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the box. The ape reached into the box with its arm in a vain attempt to 

reach the banana. It then ran to the opposite end of the box, where it again 

looked in. Then, suddenly, the ape turned to the pole and instantly pushed 

it through the box and then ran around to the other side to obtain the 
banana. 

How can this behavior be explained away as an example of instrumental 
conditioning? Indeed, it is difficult to do so. The ape appeared to see very 
readily the relationship between the box and the stick. This, by no stretch 
of the imagination, is similar to the rat's perceiving the relation between 
pressing the lever and obtaining food in the Skinner box. 

It is another matter, however, to say that one therefore needs an entirely 
new principle of behavior to account for the observations on insightful 
problem solving. Such a need diminishes further when it becomes apparent 
that there is a more or less continuous gradation between trial-and-error 
solution and solution by insight. Furthermore, some observations indicate 
that a background of simple instrumental learning is necessary for the oc- 
currence of insight. In other words, insight depends upon and grows out of 
instrumental learning and trial-and-error behavior. 

Bridging the Gap between Insight and Trial and Error 

The background of insight. Some investigators, not merely content with 
characterizing problem solving in the higher animals as sometimes insight- 
ful, have tried to find the elements in the background of experience neces- 
sary for the occurrence of insight. 

For example, Birch (1945) presented some chimpanzees with the prob- 
lem illustrated in Figure 53. As we can see from this drawing, the food was 
placed within the blade of a hoe, and all that was required of the animal 
was that it rake the food in. Four out of six animals failed to solve the prob- 
lem within the 30 minutes allowed. One animal succeeded after it acci- 
dentally moved the food into the cage. This, of course, is pure trial and 
error. Another animal succeeded in what appeared to the investigator as 
an insightful solution. Thus, despite the simplicity of the problem and the 
general intellectual superiority of chimpanzees in the animal world, insight- 
ful solutions were not plentiful in this example. 

Birch went one step further, however. He provided his subjects with 
sticks to play with. In the course of the stick play, the animals behaved in 
such a way that Birch inferred that they were using the sticks as functional 
extensions of their arms. Such learning is probably weU described by the 
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model of instrumental conditioning. The important point, however, is that 
after a certain amount of play with the sticks, these chimpanzees could 
solve quite complicated problems, including the hoe problem, which de- 
manded the use of sticks of various sorts. 

The implications of this study are clear. Insight depends upon a back- 
ground of simple instrumental habits. If the animal has not learned these, 
the probability of insight is extremely low; indeed, solutions to problems 
are by trial and error. In many instances the occurrence of insight is simply 
by generalization of lower-order habits. Thus, in some cases at least, in- 
sightful problem solving is simply the result of positive transfer. 

The extension of trial and error. What about Thorndike’s cats? Are 



Figure 53. The hoe problem. The chimpanzee can rake in the food, which is out of arm's reach, 
simply by pulling in the hoe. (After Birch, 1945.) 

cats exceptionally stupid animals, or can they too solve problems by in- 
sight? This is not a very elegant way of putting the scientific problem, but the 
fact is that other experimenters have noted considerably more intelligent 
behavior in cats than did Thorndike. Even in the same situation, the trial- 
and-error behavior noticed by Thorndike was probably due to inexperience. 
Adams (1929) repeated Thorndike’s experiments and came to the conclu- 
sion that Thorndike’s cats, in many instances, were simply too excited and 
untamed to demonstrate anything more than random thrashing about. 

Insightful and purposeful solution of problems is not the exclusive prop- 
erty of primates. Indeed, the lowly laboratory rat can solve problems in- 
sightfully if conditions are right (Maier, 1929). Furthermore, pure trial- 
and-error behavior is not unknown among adult human beings. Despite 
the fact that people use the manipulatory symbolic function of language, 
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they still indulge in blind trial-and-error behavior when faced with an un- 
usual and difficult problem (Ruger, 1910). 

Bridging the gap. Harlow (1951) presents cogent arguments, some of 
which are like those presented above, to explain why it is not necessary to 
postulate two basic and distinct processes in problem solving. Trial-and- 
error behavior is simply the establishment of elementary habits in the ab- 
sence of stimulus generalization, secondary reinforcement, etc., from 
previous learning. Such is the case in most experimental studies of animal 
behavior. The laboratory rat, for example, never in its life encounters any- 
thing like the lever in a Skinner box until it is the subject of an experiment 
on learning. Thorndike’s cats had never before been confined in boxes from 
which it was possible to escape by pulling on strings. Therefore, the correct 
responses had to be built up from scratch. 

In other situations unfortunately all too rare in the psychological labora- 
tory, an organized sequence of training converts a naive animal into a rather 
sophisticated one. In many different ways we have had occasion to refer to 
Harlow’s experiments in which he changed monkeys who made choices on 
successive discrimination problems in a blind and mechanical fashion to 
monkeys who carefully formed and acted upon hypotheses. These monkeys 
behaved insightfully, and their behavior came entirely from a rigorous pro- 
gram of training. Their hypotheses were the result of “second-order” habits 
of observing and testing consequences, and they arose because these are the 
common features of behavior reinforced in a number of lower-order habits. 

Does this mean that problem solving is simply transfer from more ele- 
mentary habits? It is unlikely, since if this were the case, the only thing to 
prevent a rat from behaving as intelligently as one of the great apes would be 
the number and kind of elementary habits it could bring to bear on a given 
situation. There are genuine species differences in a number of important 
psychological dimensions, and while we do not know all of them, some in- 
vestigators have isolated a few that must be very important. Among these 
are the differences between animals in the capacity to react to stimuli in 
their absence and to react differently to the second occurrence of a stimulus 
than to the first. These indicate the ability of animals to react symbolicaUy 

and we shall examine next some problems in investigating symbolic be- 
navior in animals. 


Symbolic Behavior 

It is convenient to divide symbolic behavior into three classes— ( 1 ) sim- 
ple discriminations or classical conditioned responses, (2) reactions to 
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stimuli which are not in the environment but exist within the organism or 
are mediated by learned cues, and (3) linguistic reactions in which the or- 
ganisms recode portions of the environment for action in the absence of 
the particular cues. It hardly need be said that, so far as we know, this third 
variety of symbolic behavior is limited to human beings. 

Simple discriminations serve to lead the organism from place to place. 
The formation of symbolic representations in this sense is nothing more than 
learning in its most basic aspect. The second function of symbolic behavior 
is to bridge a gap in the absence of external stimulation. This we can call 
representational symbolic behavior. The higher animals evidently can pro- 
duce their own symbols, so that they can respond appropriately even after 
the external stimulus has been removed. The third function, the linguistic 
function, enables organisms to manipulate the environment symbolically. 

It is the second function which is of basic importance to the comparative 
study of problem solving. If animals solve problems by testing hypotheses, 
it is essential that they be able to represent to themselves more than the 
environment present at the moment. To the extent that there are differences 
among species in this ability, there should also be differences in ability to 
solve problems insightfully. Consequently, many investigators have exam- 
ined animals at different positions in the phylogenetic scale for their ability 

to react to stimuli in their absence. Some evidence on this question is pre- 
sented below. 

The Delayed-reaction Experiments 

W. S. Hunter was one of those who realized the importance of examining 
symbolic action in the problem-solving behavior of animals. Hunter (1924) 
devised and examined several tests which he thought would force animals 
to react to stimuli in such a way that the only inference one could reason- 
ably make was that symbolic representation had occurred. Some investiga- 
tors have disagreed with Hunter in this matter, so it will be necessary for 
us to examine these tests in some detail. The first is the delayed-reaction test. 

There are two basic methods in this test, direct and indirect. In the in- 
direct method the investigator trains the animal to associate a certain stim- 
ulus with a reinforcement. For example, Hunter presented animals with 
three doorways, one of which was lighted. If the animal chose this one, it 
was reinforced. When Hunter was certain that this association had been es- 
tablished, he would turn the light on, let the animal observe the correct 
door, and then turn the light off while restraining the animal by means of 
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a glass barrier. The animal would be restrained for a period of time and 

then released. If the an.mal chose the correct door more frequently than 

ance, Hunter thought there was evidence that it could keep a symbolic 
representation of the light. ^ 

In the direct method, pretrain.ng is not necessary. Generally the direct 

method works with primates and higher animals. For example, a monkey 

may be presented with two cups. It watches the experimenter place a raisin 

under one of the cups. The monkey is restrained for a period of time and 
then allowed to choose one of the two cups. 

Hunter tested a variety of animals by the indirect method and found that 
there were marked differences from species to species of mammals. For ex- 
ample. in his original experiments (1912), he found that rats could delay 
a maiumum of about 10 seconds, while dogs could delay up to 5 minutes 
systematic ordering of the animals Hunter tested indicated that he had 

.hrc::'r,r' 

Not everyone was convinced that the delayed-reaction test permitted an 
nequivocal inference about symbolic representation. Even in Hunter’s 
bservations, there were some indications that animals were using con- 
tinuing external stimulation to bridge the gap. In other words, they were 
mg substitutes for direct symbolic representation. For example, Hunter 

entaT maintained the bodily ori- 

rectL?n^i h did not need to maintain the cor- 

orientation in order to respond. Thus, it seemed likely that the raccoons 

were more capable of a true symbolic representation than wl the "aTs 

der other conditions, however, rats can delay up to four minutes 

(’^"Cord, 1939; MacCorquo- 
, 7). Here, ammals were prevented from responding to spatial cues 

w en the correct stimulus was moved to another door. This procedure for 
preventing response to spatial cues removes certain objections that 
psychologists have had to using the delayed response experiment as an i^h 

vie. de.ayed.„.=„„„ eape,L„„ o’, '» 

examples of complex discrimination learning (Seward 1948) Ineith 

■he solnrion .pp problem represents a level proce ii , he " '“T 

and »_e ,ha, is proh.h,, closely Imhed with .he di»e,e„,i„ ahilhy' IZt 
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The Double-alternation Problem 

Another test that has been generally considered to be something that 
could be solved only by symbolic representation is the double-alternation 
problem. In the double-alternation maze problem (Hunter, 1928) an ani- 
mal is required to learn a maze consisting of two alternative pathways. 
Each pathway describes a square and brings the animal back to the starting 
point. Figure 54 shows an example of such a temporal maze. The animal 
must learn to alternate the pathways in a sequence. In the double-alterna- 



Figure 54. The floor plan of a temporol maze. In the double-alternation problem the animal is 
required to run around twice in one direction and th^n twice in the other direction before being 
rewarded. 


tion problem the animal is required to go twice to one side and then twice 
to the other side in order to receive the reward. The sequence can be ex- 
tended, so that instead of a series such as RRLL before reward, the animal 
must learn a sequence such as RRLLRRLL. 

The rat finds it all but impossible to solve even the simplest sequence 
(RRLL). Raccoons (Hunter, 1928), cats (Karn, 1938), and other animals 
can solve the double-alternation problem. The errors, incidentally, in solving 
such a problem are distributed among the alternatives in much the same 
way they are in the linear-maze problem (Stewart and Warren, 1957). 
Preverbal children find the double-alternation problem difficult, but chil- 
dren who can verbalize solve it easily (Hunter and Bardett, 1948). 
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It IS difficult to see how any animal could solve the double-alternation 
problem as it is seen in the temporal maze, without some central mechanism 
that can teU the animal where it is in the sequence. The central mechanism 
needed is of a fairly low order, considered from the human point of view, 
since It involves, at the most, the ability to count to two. The double-altema- 
non problem, however, seems to be a more clear-cut case of symbolic be- 
havior m the absence of secondary supporting cues than does the delayed- 
reaction problem, since it is almost impossible to find grounds for external 
cues that could help an animal solve it. As Hunter points out, even kines- 
thetic cues must be ruled out in the temporal maze.' 


The Role of Symbolic Representation 

There is something more to insightful problem solving than simply the 
cumulation of transfer. Some animals, high on the phylogenetic scale, per- 
orm in such a way that it is almost impossible to avoid the inference that 
they are capable of symbolic representation. External stimuli become re- 
coded in some way so that these animals can react to cues in their absence 

or react in such a way that they can keep separate the first and second 
occurrences of a particular external cue. 

Symbolic representation can have many different functions in problem 
so vmg. For example, it probably plays an important role in multiple-choice 
problems which require an organism to remember and reject the alternatives 
already tried and found wanting (see Miller and Frick, 1949). Ifmust be 

among external events. 

the example of insightful problem solving described a few pages back 
symbolic representation of the pole while the animal was peering into the 

problem^°’' '""'ghtful solution to the 

Unfortunately, the implications of some of the basic properties of sym- 
1 C representation have never been fully explored. Physiological psychL 
gists who are interested in investigating some of the localized functions of 
he central nervous system have made extensive use of the tests we have 
mentioned. Few investigators, however, have tried to follow some of the 
implications of these tests for problem solving generally. All we can do at 

amoL"*®^ themselves and to the differential ability 

animals for problem solving. ^ 

We know that there are marked species differences in the ability to trans- 

li double alternation of lever pressing which thp rat 

Uneslhenc cues could conceivably work (Schlosberg and Ka,r!943) 
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fer complex discrimination problems (Harlow, 1949; Cottennan, Meyer, 
and Wickens, 1956, etc.). These probably also reflect underlying abilities to 
use symbolic representation. It is intriguing, though perhaps fruitless, to 
speculate on the nature of the differences in representational factors among 
different animals. Some investigators (Stewart and Warren, 1957) have 
suggested that in many tests of symbolic representation, the differences are 
quantitative rather than qualitative. In the normal adult human being, 
though, there are clear qualitative differences. 

HUMAN PROBLEM SOLVING 

We are sure that normal adult human beings can solve problems sym- 
bolically, because they possess highly developed and organized linguistic 
tools for doing so. Therefore, there is no question whatever about the nor- 
mal adult’s ability to solve problems insightfully. He may not always do so 
although he is capable, and, indeed, a knowledge of this capability is what 
leads us to suspect an analogous process among animals based upon a 
much more rudimentary symbolic process. 

For many years, Piaget and his^associates have been studying the devel- 
opment of intellectual processes in children. These studies fill out in rich 
detail the growth of the processes of insight and concept formation in devel- 
oping children. A great deal has been learned about children’s concepts of 
number, space, time, and other fundamental matters from these studies. In 
addition, these investigators have studied the actual process of problem 
solving. For example, Inhelder and Piaget (1955) point out that young 
children solve problems by finding similarities and differences among 
things; their approach to problems is concrete. Adolescents, on the other 
hand, may develop formal propositions; they try to formulate or find ab- 
stract laws for solving whole classes of problems irrespective of their spc 
cific content. Although it is on a different level and involves quite different 
processes, this description of progression from concrete to abstract proV- 
lem solving is highly reminiscent of the formation of learning sets in Har- 
low’s monkeys. 

An account of such factors, however, is more properly the subject mat- 
ter of developmental psychology. Our task at the moment is to find out what 
we can about the relationship between associative factors, learning, and the 
solution of problems. Therefore, we shall turn to the experimental literature 
aimed at the analysis of problem solving and the relationship between this 
process and mechanisms of learning. 
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Studies of Human Problem Solving 


The Formation of Hypotheses 

Hypotheses are not random. When a normal adult is faced with a prob- 
lem to solve, his attempts at solution are not random samples of the pos- 
sible behavior in which he might engage. For that matter, the cat in the 
puzzle box does not really behave randomly either. Certain classes of re- 
sponses have preference over others for both man and cat. The behavior 
of the cat is more likely to be characterized as trial-and-error behavior 
simply because, in all likelihood, it will be less systematic and more random 
than the ideal strategy for solving a particular problem. As we shall see 
shortly, men do not always or, indeed, frequently employ an ideal strategy, 
but their behavior is more likely to be systematic than the behavior of the 

cat or monkey. In this sense, man is less likely to engage in pure trial-and- 
error behavior than are any animals. 

There are many ways in which the behavior of men and that of animals 
differ qualitatively during problem solving. An animal, for example, is 
much more likely than the normal, unruffled adult to repeat over and over 
again a response which has already been tried and found not to lead to solu- 
tion. The range of possible attempts to solve a particular problem will, in 
general, be less in animals than in man. Finally, man, by his ability to use 
language, can much more effectively recode the problem into more familiar 

terms or he can exhaust systematically the possibilities for finding a solu- 
tion. 

Thus, the hypotheses men use in solving problems are not random selec- 
tions of the ones they might use. They tend to be determined by rather sys- 
tematic factors. Cofer (1954) points out that some of the most important 
determining factors in human problem solving are mediating verbal re- 
sponses.2 These are responses that the individual is not likely to make 
overtly; they are implicit verbal responses of the sort we usually mean 
when we talk about thinking. The important verbal mediating responses 
which lead to attempted solutions of a problem may not be directly elicited 
by the problem situation but may be only indirectly elicited by virtue of 
association with responses directly produced by the situation. In the next 

few pages we shall examine some of the ways in which verbal mediatine 
processes influence problem solving. ^ 


Also see Osgood (1953) for a detailed account of the theoretical 
iional processes in learning and thought. 


role of media- 
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Verbal mediating responses and direction. The way in which verbal 
mediating responses can give direction to hypotheses in problem solving is 
illustrated in an experiment by Judson and Gofer (1956). In this study, 
subjects were given groups of four words and told to indicate the one word 
not related to the other three. Suppose, for example, that add, subtract, 
MULTIPLY, PER CENT Were given. This item is easy. It is obvious that per 
CENT is the word that does not belong, since it is the one word that does 
not indicate an arithmetic operation. Suppose, however, that the words 
were add, subtract, multiply, increase. Here the answer is not so easy. 
To be sure, the same three arithmetic operations are present and increase 
is not an arithmetic operation. But add, multiply, increase all belong 
to a class of words indicating increasing magnitude, while subtract 
belongs to a class of words indicating decreasing magnitude. Perhaps sub- 
tract is the word that does not belong. 

Judson and Gofer administered many such ambiguous items to subjects. 
The object was to find out which ambiguous word the subjects would accept 
as belonging with the other words. One of the things they discovered was that 
the order of the words was important. Thus subtract, increase, multiply, 
add would more likely lead to increase being rejected, whereas multiply, 
increase, add, subtract would more likely lead to subtract as the re- 
jected word. 

Secondly, subjects were likely to accept ambiguous words that con- 
formed to their previous habits. For example, in problems like prayer, 
temple, cathedral, skyscraper, subjects with strong religious interests 
were more likely to accept the word prayer and reject skyscraper than 
subjects with little religious interest. Thus it seemed clear to Judson and 
Gofer that the direction chosen by the subject was the result of activation 
of mediating responses which recoded the material in a particular way. The 
probability of a particular mediation response is determined in turn by its 
relative associative strength. This point will become quite clear in later 
examples. 

Fixation in Problem Solving 

Fixation by mediating responses of function. If a particular problem 
elicits a strong set of mediating responses which point to a particular solu- 
tion, and if that solution is incorrect, we have a case of fixation without ade- 
quate solution of the problem. The subject becomes fixated on the wrong 
response and caimot switch to the right one. Frequently problems are pre- 
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sented to us in such a way that they do elicit the wrong mediating resp>onses, 
and great difficulty with the problem results. 

This point is illustrated by some of Duncker’s (1945) w-ell-known ob- 
servations on problem solving. Duncker gave students a variety of simple 
problems to solve. One of them was as follows: The subjects were required 
to mount three small candles on a door (ostensibly for experiments on 
vision). The materials the subjects needed for solution were scattered in 
confusion on a table and mixed in with a number of other objects. The 
crucial items were tacks and some cardboard boxes similar to small match 
boxes. The solution required the subjects to tack the boxes to the door, and 
then to melt wax on the boxes in order to hold the candles. 

This problem was presented to the subjects in two ways. The difference 
was small but of critical importance. In one condition the boxes were filled 
with the experimental material — tacks were in one box, candles in another, 
and the matches in the third. In the other condition the tacks, candles, and 
matches were placed loose on the table and the boxes were empty. 

What difference did these conditions make? Those subjects who were 
presented the problem with the boxes empty all solved it, while only about 
half the subjects were successful when the boxes were filled. What was 
responsible for this difference? It is very easy to understand if we consider 
the possible mediating responses that these two versions of the problem 
elicited. When the boxes were filled, they were more likely to elicit a medi- 
ating response such as “container.” When the boxes were empty, the mediat- 
ing response “container” was likely to occur also, but not nearly so strongly. 
The result was that other mediating responses, among them perhaps “plat- 
form,” were able to occur. The filled boxes thus produced a fixated function 
which interfered with the proper solution of the problem. 

Duncker studied a number of different problems like the one we have 
just examined. His experiments were carried out with a good deal of in- 
genuity, but, unfortunately, they leave much to be desired, since the number 
of subjects was very small and the experiments were performed under 
rather informal conditions. Other experimenters who have worked with 
these or similar problems have achieved much the same results, however 
so this functional fixation, as Duncker called it, seems to be a genuine 
effect (see Adamson, 1952). 

One important point comes out of subsequent experimental work. If time 
is allowed between the exposure which sets off incorrect fixation and the 
actual work on the problem, the fixation disappears (Adamson and Taylor, 
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1954). Thus, it is clear that whatever kind of verbal mediating habits are 

responsible for the fixation, they behave much as other kinds of verbal 
habits. 

Other experimental work has indicated the effect of verbal responses 
suggested to the subject upon the direction problem solving takes (Maier, 
1930, 1931). In one experiment, merely teaching the subjects lists of 
words which could arouse associations pertinent to solution increased the 
frequency of relevant solutions (Judson, Gofer, and Gelfand, 1956). Gofer 
points out that such activation of appropriate solutions probably occurs 
through mediated generalization (Gofer and Foley, 1942). For example, 
if the word “rope” is associated with “swing,” subjects are more likely to 
solve a problem which involves getting two strings tied together by swing- 
ing one to the other by means of a pendulum bob. If, on the other hand, 
“rope” is associated with “hemp” and “pendulum” with “clock,” the pen- 
dulum swinging solution is less likely to occur. 

Riddles, puzzles, and similar brain teasers are based on the probability 
that a particular way of stating a problem wUI elicit a very strong associa- 
tion that prevents the appearance of a mediating response leading to the 
correct solution. This same sort of thing often makes students complain 
about trick questions on examinations. 

Fixation by method. In the examples we have discussed thus far, direc- 
tion of solution has been seen to be determined by verbal mediating proc- 
esses either set off by the problem situation itself or suggested by verbal 
associations given to the subject. Fixation, however, can occur by other 
means as well. Probably one of the most important is the pattern of rein- 
forcement for particular kinds of solutions. This kind of fixation is illus- 
trated by some experiments by Luchins (1942) on a well-known kind of 
problem. 

Luchins asked a group of subjects to solve a number of problems all of 
the same type. Here is an example: Given a 3-quart jar, a 21-quart jar, 
and a 127-quart jar, how would one measure out exactly 100 quarts? The 
solution is to subtract 21 quarts from the 127 quarts, which makes 106 
quarts, and then to subtract 3 quarts twice. The first problems given to the 
subjects always required the use of all three measures for an efficient solu- 
tion. After the subjects had solved six such problems, others were intro- 
duced which could be more efficiently solved with only two of the measures, 
though all three measures could be used. Not surprisingly, perhaps, the sub- 
jects kept on working through these simpler problems by the more compli- 
cated method. Thus the habit built up by the earlier problems persisted. 
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even though it no longer provided the best method. When the subjects were 

instructed to write “don’t be blind” on their papers immediately after the 

sixth problem, a much smaller number persisted in the more difficult 
method. 

Such persistence of habits no longer most efficient for solving particular 
problems has been characterized as rigidity, and a number of investigators 
have experimental evidence to indicate that this kind of rigidity is modified 
by the scheduling of problems and hence of the habits that are reinforced. 
For example, on a conceptual card-sorting task, Schroder and Rotter 
(1952) demonstrated that a training series requiring frequent shifts in set 
resulted in much greater flexibility than one that required no shifts. 

Maltzman and Morrissett (1952; 1953) performed a series of experi- 
ments in which they produced various fixations on the solution of anagram 
(scrambled words) problems. They pointed out that the effects produced 
by fixation are easily interpreted as the result of simple habits. As a matter 
of fact, they were able to derive and verify certain predictions about fixa- 
tion from the assumption that the strength of the habits underlying fixation 
is a simple negatively accelerated (exponential) function of the number 
of previous elicitations of that habit. Remember from the discussion in 
Chapter 2 that such a simple negatively accelerated curve is the most basic 
and elementary kind of learning curve. Thus, there is every reason to be- 
lieve that the habits responsible for fixation are determined by all the basic 
principles of learning. 

Hypotheses in human problem solving are determined by the arousal 

of verbal mediating responses, and these are aroused by and arouse in turn 

simple associations. These associations may or may not lead to the correct 

solution of the problem, and to a certain extent the individual will try to 

direct or restrict associations to areas which have some probability of lead- 
ing to a solution. 

The habits that are aroused by the verbal mediating processes vary in 
strength. The stronger habits are those that have occurred more frequently 
m the individual’s past history, and these have a greater probability of oc- 
curring in any problem situation than weaker habits. Furthermore, they 
depend upon the stimuli with which they have been associated in the past. 
For example, a pendulum bob has a very low probability of eliciting re- 
sponses appropriate to a hammer (using the bob to drive a nail into the 
wall). Yet, when the appropriate conditions occur, subjects can vary their 

mediated responses to the bob in such a way that it occurs to them to use 
it as a hammer (Duncker, 1945). 
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When a secondary or indirect use of a familiar object occurs, it is prob- 
ably because of either direct primary-stimulus generalization (which would 
be greater, say, if the pendulum bob looked like a hammer head or be- 
cause of mediated generalization. Mediated generalization occurs along 
some learned, or conditioned, aspect of stimuli and is characteristic of the 
relationships between words in a language (Cofer and Foley, 1942). Thus, 
the word “vane” might give rise to its homonym “vain,” which in turn might 
arouse “fruitless” and this could lead to “barren” and then to “dull” or 
“stupid.” By an elaborate intertwining of learned and unlearned generaliza- 
tion between words (considering words as stimuli rather than items of be- 
havior), a great flexibility is added to human behavior. This must be of 
fundamental importance in human problem solving. 

Furthermore, many factors which are not primarily associative determine 
the likelihood of particular solutions or outcomes in problem solving. These 
are so-called dynamic factors in behavior, such as motivation, attitude, and 
personality structure. There is a fairly large literature on the relationship 
between these factors and problem solving. For example, Thistlethwaite 
(1950) was able to demonstrate that students with particular biases con- 
cerning racial issues make more errors drawing inferences from statements 
with racial themes than from statements with neutral themes. The greatest 
number of errors in drawing such inferences does not necessarily occur 
with emotional items that are ambiguous, but rather with those in which 
the individual can see a possible answer compatible with his views. 

Since such attitudes and, to a certain extent, motives are determined by 
associative factors, simple associative factors have great importance in de- 
termining the nature and direction of the attempts people make to solve 
problems. 

Other Aspects of Problem Solving 

We have had nothing to say about two very fundamental aspects of prob- 
lem solving. What is the nature of understanding? How does an individual 
recognize and pick out the correct solution from a set of many available 
possibilities? The discussion of these questions, while basic to an under- 
standing of human thought and problem solving, would carry us away from 
the central issues in the psychology of learning. We mention them at this 
point to make clear that we do not solve all the problems of thought by 
discussing the habit structure of verbal mediating responses. Individual dif- 
ferences in ability to solve problems and to think creatively cannot be ade- 
quately described in terms of simple habits, but an understanding of the 
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role of habit, transfer, and generalization in problem solving gives us a 
grasp of certain fundamentals of the process. 

The Learning of Concepts 

The world presented through our senses is a vast jumbled confusion of 
different sensations. We are able to deal with it only by cutting it down to 
the size of our own mental processes. The primary way we do this is by 
setting up equivalences and identities among separate parts of our expe- 
rience. In short, we categorize and assign names to the categories. Thus, 
the deep maroon color of the book on my desk and the tomato-colored 
stripes in my tie both are characterized as “red.” This process of categoriz- 
ing so that all the infinite variety of the external world may be dealt with 
by our mental processes and language, is one of the most essential elements 
in human thinking. We call this process concept attainment, or concept for- 
mation. In the next few pages we shall examine some of the experimental 
literature on concept attainment and see how special problems in the or- 
ganization of concepts are related to the basic principles of learning. 

Forming Concepts 

Categorization. Most of us are so familiar with the conceptual cate- 
gorization of our world that we do not stop to think about the process of 
arriving at categories. We may even naively believe that our conceptual 
categories have independent existence. The habit of accepting the reality of 
conceptual categories may be so firmly established that it will interfere with 
scientific progress and thinking. Only in the past few years have systematic 
biologists generally come to the view that “species” of animals are not fixed 
classes into which animals do or do not belong, but rather that “species” 
are simply convenient classes into which we can group animals with similar 
characteristics. Nature is often continuous and will not fit the boundaries 
of our categories. Thus, frequently there is a continuous gradation in essen- 
tial characteristics from one species of animals to another. 

Every college student at one time or another has run into a homely ex- 
ample of the conflict between categories and the continuity of things. Regis- 
trars and deans demand that instructors categorize students in classes usu- 
ally labeled A, B, C, D, and F. There may be only the faintest difference 
between the student with the highest B and the lowest A, but because hu- 
man society and thinking cannot function with continuously graded events, 
and must resort to categorization, we are required to place these students 
in different classes. 
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We form classes by isolating attributes of things which are to be classed as 

identical or equivalent. There are two ways of doing this. We may take a 

heterogeneous collection of things and examine them for common attributes 

so that we can sort them into classes, or we may come to examine a number 

of things with some preconceived notion about the classes into which they 

ought to be placed. In either case, we assign names to the classes (thus 
recoding our experience). 

Our categorization is usually done in accordance with social reinforce- 
ment. Thus, children learn at an early age to discriminate between dogs, 
cats, and rabbits, though they may make mistakes during the early stage of 
learning to assign the proper names to the individual animals. 

Learning conceptual categories. Osgood (1953) reminds us that the 
formation of concepts involves verbal mediating responses which permit 
us to recode the continuity of our experience. The classical experimental 
literature on the formation of concepts has dealt with the ways in which 
such verbal mediating responses are established. 

For example, Hull (1920) studied the way in which students learned to 
find identity relationships between stimuli possessing common elements. 
He presented subjects with Chinese characters paired in a certain way with 
English words. Chinese characters are compounded of certain elements, 
called radicals. These may vary in position or size within the character, but 
whenever they are present it means that the character has something in 
common with other characters which also contain the same radical. They 
are roughly analogous to certain syllables (mainly suffixes) in English. 
Thus, the words “repeat” and “return” have in common a syllable (actually, 
a bounded morpheme) which carries the connotation of coming back over 
the same event. In Hull’s experiment, whenever a certain radical occurred it 
was always combined with a certain English word. The problem for the sub- 
jects was to recognize and associate the radical with the appropriate Eng- 
lish word. 

As might be supposed, such learning is much the same as any other kind 
of paired-associate learning. There were, however, some cases in which 
subjects obviously had learned to associate the proper English word with 
the proper radical, but these subjects could not recognize the radical. Thus 
they were reacting to something they could not verbalize. They had learned 
the correct pairing of the Chinese characters with English words, but they 
had not acquired a mediated recognition of this. 

Other experimenters (Heidbreder, 1924; Smoke, 1932) have studied 
the role of hypothesis formation in the discovery of concepts. Smoke (1932) 
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reports that his subjects systematically tested and rejected a number of 

hypotheses— thus making use of transfer— until they finally came upon 

one which gave them enough correct responses to be satisfying. Heidbreder 

(1924) emphasized the fact that the learner does not always have to be 

actively engaged m hypothesis formation in order to arrive at a solution 

of the problem. In especially difficult problems some subjects engaged in 

what Heidbreder called “spectator” behavior; these subjects would resign 

themselves to responses which were more or less random and would spend 

their time in passive observation of the material presented to them. Under 

these conditions, subjects eventually solved the problems. Thus a period 

of quiescent observation may be just as valuable as a period of active 
hypothesis formation. 


The attainment of specific concepts. Despite the fact that concept at- 
tainment does not differ qualitatively from other kinds of learning there are 
differences in the ease with which various concepts are learned. 

Heidbreder (1946a, 1946b, 1947) had subjects learn to attach labels to 
the concepts illustrated in Figure 55. From the pictures we can see that 
some of the concepts were highly concrete (faces, buildings, etc.), and 
others were unfamiliar and abstract (“nonsense” forms, numbers, etc.). 
Heidbreder found that there was a regular order in the attainment of con- 
cepts. The concrete-object concepts were the easiest to learn, the nonsense 
form concepts of intermediate difficulty, and the abstract number concepts 
most difficult. Heidbreder attributed this to a special factor she called the 
hierarchy of dominance, which is due to the objectlike character cl per- 
ception. Thus, the concept of “face” is easy to attain, since the instances 
o It are perceptual entities — in a word, faces. Numbers are difficult to 
attain because the instances of them are embodied in concrete objects 
rather than being concrete objects themselves. Thus, the concept of “two” 
might be exemplified by two spoons or two rabbits. 


A subsequent series of experiments (Dattman and Israel, 1951) did not 

confirm Heidbreder because different ways of embodying concepts were 

used. When appropriate instances of numbers or abstract figures are used 

t e order of attainment of concepts disappears. This finding makes it quitj 

evident that the ease of attainment of concepts most certainly depends upon 

e stimuli and the relationships between stimuli used to exemplify ffie 
concepts. ^ ^ 

with to describe the differences in ease 

tul Hei't T'"'’ A fundamental 

Hei.breders. An experiment by Baum (1954) clearly shows that the 
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Figure 55. Pictures used in Heidbreder's experiments on concept formation. Each illustrates a 
concept which is nomed by a nonsense word. The concept MANKS (sixes), for example. Is illustroted 
by the six stars or the six wavy lines. {Heidbreder, J946b.) 


relative ease with which concepts like Heidbreder’s are attained depends 
upon how much interference there is from stimulus generalization produced 
by similarity of items belonging to different concepts. Baum was able to 
account for all of the differences in ease of attainment of concepts found by 
Heidbreder by the amount of confusion between guesses. For example, sub- 
jects in Baum’s experiment had greatest difficulty attaining the concept of 
“sixness.” They would frequently give the response appropriate to “three- 
ness” when faced with the concept of “sixness.” Thus, there was generaliza- 
tion between the stimuli appropriate to these concepts. We can see, from the 
illustration showing Heidbreder’s stimuli, that six could be easily confused 
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with three, since frequently the stimulus embodying the concept of “six- 

r' ihe subjects 

found the concept of ‘faceness” quite easy; they would only confuse it with 

sfmeTr^ generahzation (or, more accurately in 

some mstances, mediated generalization) produces confusion between the 

examples themselves, which leads to slower learning of some concepts 

gam, as we might expect, since concept attainment seems to obey all 

c rules for other kinds of learning, the rate of attainment of a concept is 

determined by discnminability among the instances of different concepts 

I more°d ffi 

(as Heidb^^ r P^rccptu.l thing-quality about it 

(as He dbrcder seems to imply), but rather because an abstract concept is 

genera ly embodied in specific instances that are less easy to discriminate 

from those of other concepts. Thus a child has less difficulty learning the 

oncept of “tree” than that of “weed,” because trees are more disc "^bl 
from other growing things. oiscriminaDie 

clearly made a difference between Baum’s reasons for the variations in 

ease of attainment of concepts and Heidbreder’s, for nearly always con 

cepts which are easily seen as intact perceptual objects are easier to dis 

crimmate than concepts which are not concrete things. Indeed, it may be 

that Baum and He.dbreder are putting forward much the same idea BaL’s 

ersion seems to be the more general one, however, and it relies on ideas 

erived from more elementary problems in learning, whereas Heidbreder’s 

on IS a specific hypothesis advanced to account only for the data on con- 
cept attainment experiments. 

Positive and negative instances. In learning ahnnt 
pro6, ,„o t.„ds o, ' 7 T 

are of the concept and include the essential chi 

.l.e examples are instance, of what thrconcep H 

ntote 0, the necessatj characteristic, ZT a ph^ an IT 1 " 

parttcular disease „ a high fe.er, spots, and anTrTI “Za TeZ,' 

If, however, only the high fever and erratir r^.ic^ Present. 

spots, it is not an example of this particular Lease.'' 

In some early experiments by Smoke f 10^91 tKc i. 

negative examples did not materially contribute to 

less information in negative than in positive examples, is ^0^ 


296 


THE PSYCHOLOGY OF LEARNING 

whether the differences obtained by Smoke are because of the smaller 
amount of information or because people just have difficulty in assimilating 
negative examples. Therefore, Hovland and Weiss (1953) performed a 
series of experiments in which the amount of information in positive and 
negative instances was controlled. It turned out that subjects do better with 
positive examples even under these conditions, though some subjects could 
learn entirely on the basis of negative examples. 

There are many reasons why positive rather than negative examples lead 
more readily to concept attainment. For one thing, the required characteris- 
tics are directly presented, whereas with negative examples, if the subject 

is to learn what the concept is, he must put together all of the things a con- 
cept is not. 

In everyday life, concepts may be almost impossible to establish bv nega- 
tive instances. This is because the ways in which things differ are so enor- 
mous that a single negative instance scarcely reduces uncertainty at all. Try 
to describe a simple object such as a book by mentioning all of the things it 
is not. It is possible in the experiments by Hovland and Weiss in which 
negative instances were designed to tell as much as positive instances, that 
the subjects did not do as well with the negative ones simply because they 
were not used to dealing with concepts in this manner. A machine could 
be constructed which would solve the Hovland and Weiss problems just as 
easily either way. It is clear, however, that human beings have great diffi- 
culty with negative examples even when the information presented is con- 
siderable ^he fact is, though, that when forced to it people can arrive at 
concepts through negative instances. 

Furthermore, as Bruner, Goodnow, and Austin (1956) point out, many 
concepts must be arrived at largely through negative instances. This is 
characteristic of the attainment of disjunctive concepts. Disjunctive con- 
cepts are those in which one or another attribute may be used to identify 
the concept. These authors give as an example the difficulty in determining 
substances responsible for an allergic reaction in an individual. A food 
allergist must start by finding out what his patient is not allergic to and 
from there try to arrive at a class of allergy-producing substances by many 
negative examples and perhaps a few positive ones. 

Strategies in Concept Formation 

People do not arrive at hypotheses in problem solving by random selec- 
tion. Rather they employ strategies, though they need not necessarily be 
aware of committing themselves to particular plans. Strategies are arrived 
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at by deciding which class of responses one wishes to try in solving a prob- 
lem or attaining a concept and by deciding in what order these responses 
are to occur. 

For example, I forget my keys one morning, and I must borrow a set 
from the janitor to get into my office. The keys he gives me are all alike so 
I must decide how to find the one that fits. One strategy would be to ’try 
the keys in order. Another would be to examine the keys for a clue, such as 
code numbers. If I am harried and out of sorts, I may simply jam the keys 
mto the lock more or less at random. The last is a strategy too. The prob- 
lem of strategy is an important one, because we know that there must be 
a best approach for every problem. Some investigators have designed prob- 
lems m such a way that they know the best strategy in advance, and these 

experimenters have tried to find out if subjects use the best strategy, or if 
not, which one they do use. 

Bruner, Goodnow, and Austin (1956) have made use of some special 
techniques for studying strategies in concept attainment. In a typical experi- 
ment, a subject would be presented with a set of cards. These cards varied 
m four ways: (1) they had different figures on them; (2) they were of dif- 
ferent colors; (3) there were different numbers of figures on different 
cards; (4) they had different numbers of borders around them. Each of 
these “attributes” had three different values; thus there were three kinds of 

figures, crosses, circles, and squares. AU 81 possible cards in this set are 
Illustrated in Figure 56. 


The experimenter would form a concept built on selected “values” from 
these attributes. For example, the concept might be green circles with two 
borders. Thus all cards with both green circles and two borders (conjunc- 
tive concept) would be correct. The subject’s task was to find out what 
he concept was by choosing a card and asking the experimenter whether 

his was an example of the concept or not. What strategies do people fol- 
low in choosing the cards? ^ ^ 

Kmds of strategies. First of all, there is a logically best strategy. In the 
problem we outlined there are 255 possible concepts. A first positive card 

^ positive and 

track of 7 7T ^ keep 

rack of all of the possibilities logically eliminated and solve the problem 

Iver thT’*" h ^ how- 

, that such a strategy places an almost impossible burden on the human 

memoi^ and the capacity for assimilating information already ob^red 

Thus It is very unlikely that anyone would use this completely logical ap- 



Figure 56. The Instances used by Bruner^ Goodnow^ and Austin to illustrate attributes. The forms 


J, 


varied in (1) shape, (2) color, and (3) number of borders. There were three values for each ottribute 
(e.g., color was either green, red, or black). (Bruner, Goodnow, and Austin. 1956.) 
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proach. Some subjects, however, do make a reasonable approximation to 
the logical strategy.^ 

Another strategy the investigators called conservative focusing, and it 
consists of choosing the test cards so that a change in just one attribute is 
made after the subject happens onto a positive instance. For example, sup- 
pose the subject chooses a card with two green circles and two borders. 
He might then decide to concentrate on the attribute of shape, and for his 
next card choose another shape to test (for example, two green crosses with 
two borders). Or finally, the subject might take a gamble and change two 
or more attributes at the same time. This the investigators call focus gam- 
bling; it is gambling because, while the subject may solve the problem in 
fewer trials than with conservative focusing, he may also take longer. 

The use of strategies. In general, as we might suppose, subjects do 
better with conservative focusing than with logical strategy (or an approach 
to it). The superiority of the focusing strategy becomes greater when the 
strain placed on memory and assimilation becomes greater. People who try 
to approximate the logical strategy do more poorly as conditions become 
more difficult. If the subjects are limited in the number of choices they are 
permitted to make (imposed by telling the subject that he can have only 
four or perhaps even one choice to try to solve the problem), the subjects 
tend to shift over to focus gambling. Finally, subjects change their strategy 
as they move from problem to problem. Subjects who have had many posi- 
tive examples in earlier problems gradually shift to a focus gambling strat- 
egy, while those subjects who have had many negative examples shift more 
toward the conservative, one-change strategy. Thus, concept attainment 

strategies are altered by the subject’s subjective estimates of changes in the 
probability of payoff. 

Thus, even such an extended and general characteristic of behavior as 
the use of strategy in problem solving and concept attainment is determined 
by the pattern of previous reinforcement. Also individual habits, less di 
reeled by payoff than by the verbal mediational activity characteristic to 
one’s life and work, influence strategy. Bruner, Goodnow, and Austin point 
out that a few of their subjects of very superior intelligence and usually with 
some mathematical training tried to attain the concepts by a purely logical 
strategy. This was an error in the sense that it failed to meet the objectives 
Of a workable strategy. Thus the mediational processes that directed these 


and "I," 0951) shows that the same limitations of assimilation 
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people to attempt a purely logical strategy failed to produce adaptive be- 
havior. 


Learning and Thinking 

In conclusion, we can point out that the intimate relationship between 
learning and thinking or problem solving does not mean that efficient think- 
ing is entirely determined by associative mechanisms. It is, however, cer- 
tainly determined to a considerable extent by associative learning, and 
furthermore the limitations of associative learning provide very real and 
important limits to our ability to solve problems. We cannot behave like 
logical computing machines, because we do not have the ability to scan 
our memories almost instantaneously and pick out and act upon the appro- 
priate stored information. We can carry in our heads a limited amount of 
information, and we are dependent upon external cues and the situations 
of the moment for our ability to extract and act upon any piece of informa- ^ 
tion in our memory. Furthermore, from the logical point of view our storage 
systems are haphazard and inefficient. We recode things in a hit-or-miss fash- 
ion; we may rely on a nonsense rhyme, for instance, to help us remember 
the order of the cranial nerves for an examination in comparative anatomy. 

We trade order and magnitude for flexibility, however. A computing ^ 
machine can operate on only one set of instructions. It is saddled with the 
strategy set into it by its operators. Its human operators, on the other hand, 
can move here and there. They can make guesses and govern the reason- 
ableness of their guesses by an intuitive and inefficient, but effective, calcu- 
lation of subjective probabilities of payoff. These guesses and the strategies 
for dealing with them are learned, and the ability of human beings to make 
effective guesses in working out problems is determined by the pattern of 
past associative learning. 

At the same time we are at the mercy of our prejudices and beliefs. A 
large experimental literature is devoted to the demonstration that critical 
thinking and logical reasoning among individuals often turns out to be a 
reflection of prejudice and earnestness of belief. 


These dynamic factors, as we have seen, are an important part of asso- 
ciative learning. Motivation influences behavior, and it is behavior, overt or 
implicit, that we learn. Therefore, we learn what our motives dictate that we 
learn. In part, this means that our thinking processes are indirectly de- 
termined by the influence of dynamic factors on learning, but it also means 
that these processes are directly determined by the same factors that de- 
termine learning and memory. 



CHAPTER 12 


THEORY AND 

APPLICATIONS OF LEARNING 


Although we have examined many specific theoretical problems in the 
psychology of learning, we have not yet taken the opportunity to look at 
some of the more general theoretical questions. The purpose of this chapter 
is to survey some of these broader aspects of learning. 

In the course of examining particular topics such as partial reinforce- 
ment, we have seen some of the hypotheses that have come out of various 
theories of learning. In this chapter, rather than concentrate on these 
limited hypotheses, we shall look at some of the contrasting strategies of 
theories of learning and how these came to be. In addition, we shall want 
to consider the relationships of the study of learning to other biolog- 
ical and social disciplines. For example, the neurophysiological basis of 
learning is one of the most fundamental problems in physiological psychol- 
ogy, and we could not consider our survey of learning complete without at 
least a glance at current neurophysiological theorizing. Another important 
question concerns the relations between animal behavior in nature and the 
facts and theories of learning. We should be neglecting something funda- 
mental if we did not briefly survey some of the current work on this topic. 
On the other side of psychology, we have yet to consider the psychology 
of learning ir relation to personality theory, social behavior, and the educa- 
tive process. These are topics we shall need to cover. 

This chapter, then, will include a summary of the theoretical background 
of the psychology of learning and of the relationships between the study of 
learning and other biological and social disciplines. In the first section we 
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shall consider learning theory and after that, turn to a survey of the broader 
implications of the psychology of learning. 

LEARNING THEORY 

Psychology generally, and particularly the psychology of learning, has 
ong been noted for the exotic variety of its theories and the bitter argu- 
ments of their exponents. For several reasons, however, we shall not ex- 
amine each of the classical and current theories of learning. First of all, 
t IS task has already been done in a very complete way by HUgard (1956), 
and secondly, the grand, universal theoretical controversies are somewhat 
out of date. Not too many years ago, a psychologist interested in learning 
was 1 entified by his adherence to one current theory or another, and these 
theories were “opposed” to one another from the ground up. Theoreticians 
(and experimentalists) spent much time in fruitless argument with one 
another, either by damning the logic of an opposing theory or by attempting 
to demolish it through the performance of “crucial” experiments. 

Such theoretical argument has not entirely disappeared today for the 
good reason that hardly any of the basic issues have been settled to every- 
one’s satisfaction. The strong emphasis upon large scafe theories of be- 
avior and devoted adherence to one theory or another have disappeared. ^ 
Currently, psychologists in learning are more likely to quarrel about meth- 
ods of approach and about areas fruitful for research than about universal 
theories. Since it is recognized that there is no theory of learning currently 
available which is able to “explain’’ all the facts of learning, there is little 
talk about the exclusive and ultimate correctness of one or another theory. -f 
One of the things likely to arouse an argument among psychologists is 
the question of the relative merits of trying to derive from rigorously quan- 
titative deductions limited hypotheses about highly specialized problems or 
trying to sketch in a rather intuitive fashion a qualitative description of 
fundamental problems and those of general interest. Thus, one group of 
psychologists may be interested in rigorously deriving a mathematical theory 
of the effects of partial reinforcement on the behavior of the rat in the 
runway, and another group may be interested in trying to find the main 
elements in the relationship between associative learning and problem 
solving in intelligent human adults. There is really little to argue about be- ' ■ 

tween such groups, except the relative merits of the problems they have i 

chosen to work on, since the problems themselves will only at some future e 

date come to a common ground. 
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In this section we shall examine a few of the approaches that are currently 
important in the theory of learning. This will not be an exhaustive account, 
since we have space only to discuss a limited number of problems and 
methods that are of interest to theorists at present. 


The Quantitative Approach 


The Hullian School 


In the background of the main currents of attempts to derive theoretical 

principles of learning by mathematical techniques stands the work of the 

late C. L. Hull. His chief efforts in the theory of learning are only in a very 

primitive sense mathematical, but he, more than any other worker, alerted 

psychologists to the possibilities of applying mathematics to quantitative 
problems in learning. 

Hulls approach. Hulls method^ was to make a number of postulates 
and see what theorems could be derived from combinations of these. Postu- 
lates are fundamental statements not directly testable by experimental 
means. Theorems, which are derived by rigorous logic from the postulates, 
are experimentally testable, however. Thus, one of Hull’s postulates con- 
cerns a stimulus trace, or afterdischarge, as the result of stimulation. Such 
a trace cannot be directly measured. Its properties in conjunction with other 
postulated properties lead to predictions about the form of the relationship 

between ease of conditioning and time intervals between stimuli. This rela- 
tionship should be testable. 

Though this approach provided the basic framework for Hull’s work, in 
actual practice his theorizing fell considerably short of his aims (Koch, 
1954). Very often, he slipped into attempts to demonstrate the adequacy 
of his postulates by citing experimental evidence to show how “reasonable” 
they were. His deductions were sometimes contradictory and in logical 
error. Furthermore, his theory seemed more quantitative than it really was. 
This statement deserves some further comment. 

Hull’s quantitative theorizing consisted mostly of putting his basic postu- 
lates in the form of mathematical equations. He did little in the way of 
deriving theorems by mathematical means from these postulates. Ordinarily 
It IS the mathematical derivation of theorems from postulates that people 
mean when they talk about mathematical theories. HuU simply states an 
equation and lets it go at that. Since there are almost limitless possibilities 


I The best statement of Hull’s beliefs about the method 
m his Principles of Behavior (1943). 


and philosophy of science is 
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for the choice of equations for each situation, his efforts in this direction 
appear arbitrary to say the least. 

An even greater difficulty with Hull’s quantitative theorizing is the fact 
that his basic equations seem to lead nowhere. Often, psychologists state 
relationships in a quasi-mathematical form. Thus we may have heard “ten- 
sion is directly proportional to the amount of conflict.” Such a statement 
implies that the relationship between tension and conflict can be written 

\{ T = kC 

Now there is nothing wrong with this statement and the equation except 

that we have no way to measure either tension or conflict in such a way that 

we can confidently test an apparent linear relationship between them. There 

IS just no way to measure quantitatively either amount of tension or amount 
or conflict. 


This inabiJity to measure many of the concepts Hull talked about was 
part of his difficulty. His most important hypothesis about learning is that 
of habit strength, which is an intervening variable not directly measurable. 
In order to make a quantitative statement about habit strength that can be 
tested experimentally, it must be related to direct measures of behavior 
such as the amplitude of a galvanic skin response. Since there are an enor- 
mous number of measures of behavior that can be used to evaluate learning, 

/ there must be an equally large number of arbitrary “postulates” needed to 
relate habit strength to behavior. Hull made valiant efforts to solve this 
problem (Hull, et al., 1947), but his attempts were so conspicuously un- 
successful that they reflected on the adequacy of his whole theory. 

Hull’s method provided two important lessons for subsequent quantita- 
^ tive theorists. ( 1 ) Given the present confusion over basic problems of 
measurement and fact in psychology, a rigorous postulational approach can 
work only by making each theory cover a very limited area. A compre- 
hensive deductive theory of behavior is practically impossible at present. 
(2) The mathematical relationships in the theory must be developed from a 
very few definitions and postulates. The proliferation of arbitrary mathe- 
matical equations simply leads to confusion, both in the tlieory and in the 
experimental test of the theory. 


Hulls contribution. Although we have been critical of Hull’s method, 
it should not be assumed that all his work was futile. In the first place, he 
did introduce learning psychologists to quantitative theory. Secondly, and 
more importantly, Hull fixed for many years the basic problems and con- 
cepts of learning theory. Hull made a fruitful union of mathematics, classical 
association theory (see page 309), and motivation theory — and out of this 
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came our basic contemporary concepts of drive, stimulus generalization, 
secondary drive, etc. Thus, to a considerable extent we owe the basic con- 
ceptual framework of the psychology of learning to Hull. Workers in mo- 
tivation theory, m the physiology of motivation and learning, and in per- 
sonality theory — all use concepts which were either introduced or made 
popular by him. 

Not the least of Hull’s contributions was in forcing other theorists to 
come to grips with him. The long-standing controversy between Hull and 
his associates and Tolman and his associates led up many a blind alley, but 
out of it came important knowledge. This controversy contributed to the 
clearer understanding of the separation of motivational and learning vari- 
ables and to the understanding of the role of reinforcement in controlling 

behavior. Therefore, in a very real sense, Hull stands at the origin of much 
of modern behavior theory. 

Work in the tradition of Hull is very much alive. K. W. Spence has at- 
tempted to correct many basic defects in Hull’s theory. He has severely 
limited the applications of his fundamental postulates to very special prob- 
lems in behavior. He has, at the same time, attempted to simplify Hull’s 
basic theory and make it more amenable to ordinary mathematical treat- 
ment. In a recent book (Spence, 1956) he has presented his own theory ot 

learning, and this probably stands as the final and culminating work of the 
Hullian school. 

Statistical Learning Theory 

The behavior of animals and men is variable. Learning curves are irregu- 
lar. Most earlier learning theorists regarded this variability as a nuisance 
which could be minimized by careful experimental control and by studying 
behavior only in the most circumscribed situations. There exists, however, 
a mathematics of variability, or randomness, and it was inevitable that 
sooner or later psychologists would apply some of the mathematics of ran- 
domness to the study of behavior. Hence, the statistical theories of learning 

The basic notion behind any statistical theory is that the events one 
measures m the laboratory are the result of the influence of a whole popu- 
lation of elementary random events. These are events which cannot be pre- 
dicted; a good example is the unbiased toss of a coin. If we toss a penny 
enough times it will come out heads about 50 per cent of the time and taik 
about 50 per cent of the time. We can think of the percentage of heads a^ 
result measured m the laboratory and each individual toss of the coin 
as one of the elementary events that goes to make up the result. In statistical 


306 


THE PSYCHOLOGY OF LEARNING 

learning theory, postulates are stated about the distribution of elementary 

random events, and measurable results are predicted from a mathematical 
development of these postulates. 

In recent years a number of theorists have developed statistical learning 
theories. The theories of Bush and Mosteller (1955), Estes and Burke 
(1953), and Restle (1955) are all rather similar to one another in method 
and outcome. Estes and Burke (1953) have called theirs a theory of “stim- 
ulus variability.” The essential notion they use to derive the basic curves for 
classical and instrumental conditioning is that each time a response occurs, 

It becomes associated with a sample of all of the possible stimuli that are 
likely to impinge on the organism on successive trials. We can think of the 
sample as a random sample of stimulus elements that manage to get through 
the sensory input of the organism at the time a particular response occurs. 

Notice that m this theory, learning is an all-or-nothing affair. The sample 
of stimulus elements that occurs simultaneously with any given response 
becomes conditioned to that response. The reason, according to the theoiy, 
that learning curves in practice are gradual and irregular is that different 
samples occur on successive trials. These samples will overlap to some ex- 
tent, so after an initial trial some of the stimulus elements will already be 
conditioned. Others will not, however, and these become conditioned on 
that trial. We can see that if the population of stimulus elements were a 
limited one, eventually so many elements would have been conditioned that 
there would be none left to condition. Thus, conditioning approaches a 

limit of 100 per cent. An intuitive idea of how the cumulation of learning 
occurs can be derived from Figure 57. 

The strategy of statistical learning theories. Statistical learning theorists 
have, to a considerable extent, avoided the problems which plagued HuU. 
Their postulates are for the most part elegantly simple. Bush and Mosteller 
(1955), for example, are able to state an equation which implies that the 
change in probability of a response will be determined by two parameters, 
one associated with reward and the other with punishment. The rest of the 
mathematical development of their theory is given over to the applications 
of their basic equation and the definition of events, such as trials, ^nd 
measurements, such as latency. 

Furthermore, statistical learning theorists are not bothered by the cumber- 
some necessity of transforming equations which relate independent variables 
to intervening variables into equations which relate independent variables 
to dependent variables. The equations are aU stated in such a form that 
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there is only one fundamental measure of behavior, the probability of occur- 
rence of a particular response. All other measures, such as magnitude of 
response and latency of response, are derived from definitions of probabU- 
ity of response. Tlierefore, the statistical learning theories have the consid- 
erable merit of suggesting a fundamental measurement in psychology, the 
measurement of probability or, empirically, the relative frequency of occur- 
rence. This has found widespread acceptance among theoretical psycholo- 
gists, for a variety of efforts in psychology have recently converged on the 
common notion that the basic measurement of behavior must be in terms of 
probability of occurrence of different responses. 


POPULATION OF STIMULUS ELEMENTS 



Figure 57. An illustration of two successive trials according to statistical learning theory. Each dot 

represents one of the "elements" of the learned act. The dots enclosed by trial 1 are present and 

hence conditioned on the first trial. The dots enclosed by trial 2 are present on the second trial. 

Notice that some of these dots are already conditioned, so that the second trial adds fewer ele- 
ments to the strength of conditioning. 


The limitations of statistical learning theory. The elegance and sophisU- 
cation of statistical learning theories should not blind us to certain defects. 
One concerns the kinds of behavior the proponents of these theories have 
chosen to study. To a considerable extent, these theorists have gone over the 
same ground covered earlier by the Hullian school. This has meant that 
they have not discovered much that is exciting or new. 

Strangely enough, where these theorists have ventured into new fields, 
there has been considerable dissatisfaction with the results. One field that 
has been invaded to quite an extent by the statistical learning theorists is 
the study of “guessing behavior” in human beings. The experiments on such 
behavior are familiar to us, since we reviewed them in the section on partial 
reinforcement. It is the fact that these problems grew up as analogies to the 
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effects of partial reinforcement in animal behavior that has led to the attempt 
to deal with them by statistical learning theories. 

In the guessing experiments, subjects face two lights mounted on a board. 
The subjects are to guess while one light is on whether the other one will 
come on also. The experiment is arranged so that this will occur a certain 
number of times, though the individual occurrences are nearly random. In 
this experiment, subjects are likely to guess, after much practice, about the 
same percentage as the actual occurrences of the light. This effect can be 
derived from statistical learning theory. Unfortunately, however, this same 
effect can be derived by a number of other more complicated, but perhaps 
more reasonable, theories. Furthermore, the effect does not always occur. 

It is not the rational strategy, and statistical learning theory has a most un- 
convincing way of handling the rational strategy. 

Some experiments have been designed specifically to test the implications 
of statistical learning theory for guessing behavior (Estes and Straughan, 
1954). These experiments are very limited in scope, however, and the test 
of the theory consists of curve fitting based on empirical constants, a tech- 
nique rather more likely to lead to success than failure. 

These guessing-game experiments are likely to strike the outsider as 
rather sterile and without significance to the basic problems of human be- 
havior. As models of human decision making they have a certain merit, but 
the statistical learning theories so patently ignore some basic facts of human 
behavior (the ability of human beings to remember several guesses previous 
to the one they have just made, for example) that the use of these experi- 
ments as tests of the theory seem to lead nowhere. The theory and the ex- 
periments are, therefore, much more convincingly applied to the behavior 
of rats. 

A somewhat different approach gives promise of some merit. By using 
many of the same notions as those used by the statistical learning theorists, 
Miller and McGill (1952) have been able to construct a small theoretical 
model of free recall of verbal items. This theory assumes that the probability 
of recalling a word on any trial is completely determined by the number of 
times the word has been recalled on previous trials. Such an assumption 
appears to be quite reasonable as a first approximation to a quantitative 
theory of verbal recall. Miller and McGill examine the implications of sev- 
eral levels of complexity of the theoretical model, and again by curve fitting, 
the theory seems to fit some data (it should be mentioned, however, that in 
this case the theory leads to a rather novel way of treating and presenting 
data). Unfortunately, this theory must introduce some assumptions that 
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are patently contrary to fact. For example, it must assume that the different 
words used all have independent probability of being recalled. 

In general, both the weakness and strength of statistical learning theory 
hes in the introduction of oversimplification. It is a strength because it sug- 
gests fundamental variables which need clear and explicit treatment before 
we can go on to the more complicated problems, and it is a weakness be- 
cause the behavior predicted by the theory is limited to special and exotic 
cases only remotely related to the behavior o.f real men and animals. 

Two General Approaches to the Psychology of Learning 

^ssoc/at/o/1 Theory 

If the human being is the featherless biped, he is even more character- 
istically the verbal animal. The major effort in trying to understand the 
nature of learning in man has been directed toward verbal learning. Exoeri- 
mental study of verbal learning has been enormously influenced by the tradi- 
lon of associationism, and in order to understand some of the current pre- 
occupations of psychologists interested in verbal learning, we need to look 
briefly at classical associationism and how it has influenced current work 
Actually, associationism provides the deepest current in the psychology 
of learning, and nearly all theorizing has either been directly in the associa- 

n^rttolr^^'^' h" theory has its beginnings 

m Aristotle and it runs very strongly through the history of British philoso- 

P y rom the seventeenth to the nineteenth century. The doctrine of the 

association of ideas” was designed to provide a description of the conti- 

ity of mental life. Ideas provide the units, or elements, of mental life 

and diey follow one another in a constant chain. One particular idea follows 

another one because, sometime in the past, these ideas have occurred to- 

frasth^g. *™ilar or con- 

Classical association psychology has disappeared, but the idea of the 

Crete association of elements of behavior by contiguity and by similarity 

s unuiu, generalization) provides some of the fundamental IrnZl of 
the modern psychology of learning. * 

Relationships between learning, motivation, and incentive conditions in 
odem psychology have much of their origin in the amalgamation of asso 
ationism and hedonism put forward by the British philosophers. Locke for 

impreKbr^^rfh deepest and most lasting 

pression are those accompanied by pleasure and the relief of pain. Thus 
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he says, “the great business of the senses being to make us take notice of 
what hurts or advantages the body, it is wisely ordered by nature that pain 
should accompany the reception of several ideas.” ^ 

We have seen in earlier chapters that Ebbinghaus successfully translated 
the doctrine of association of ideas into experimental work on the higher 
mental processes. Since Ebbinghaus’s time there has been a continuous 
effort to expand and elaborate the experimental analysis of the associative 
processes. From Robinson (1932) to McGeoch (1942) to Underwood 
( 1957), the experimental work on association has contributed to the scien- 
tific understanding of human nature. 

Why association theory? Why has so much attention been paid to asso- 
ciation theory by psychologists interested in verbal learning? Perhaps the 
most important reasons why associationism is an ancestor of so much of the 
contemporary psychology of learning while other philosophic doctrines 
have disappeared are ( 1 ) its emphasis upon cause and effect and the chain- 
ing of ideas and (2) its analytic nature. Since ideas and the words that 
accompany them can be separated one from another and because they can 
be combined in every imaginable way, it is possible to treat arrangements 
of them as independent variables. Independent changing of the controlling 
variables is the cornerstone of the experimental method, and classical asso- 
ciation theory provided a framework for the experimental investigation of 
verbal behavior. 

Thus the hallmark of associationism in the modern study of verbal learn- 
ing is the independent variation of the stimuli or words presented to the 
learner. We put nonsense syllables, etc., together in different ways and see 
what effect the different arrangements have upon the behavior of the learner. 

In so doing, we find justification for emphasis upon contiguity, repetition 
(the “law” of association added by Hartley), and similarity in determining 
how behavior will organize itself. With the additional consideration of fac- 
tors of motivation and incentive, a complete program for the experimental 
investigation of verbal learning emerges. 

The limitations of the associationistic approach. The classical associa- 
tionistic approach to the study of verbal learning and memory is exemplified 
by the methods of rote serial learning. These methods are designed to exert 
the greatest control over the learner; they force him to associate items 
according to the design of the experimenter, not according to his own 
wishes. The result is that the typical experiment on the verbal associative 

2 John Locke, “Concerning Human Understanding.’* in E. A. Burtt, The English 
Philosophers from Bacon to Mill, Modem Library, Inc., New York, 1939. 
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.mmediate past, he assigns an important role to neurophysiological hypothe- 
ses and makes them the central element in his theory-. As part of a general 
neurophysiological theory of behavior, Hebb has suggested a theory of 
learning, and from this we can abstract some of the important ideasln 
cerned with changes in the nervous system. Thus we can see how one psy- 

for -„e 

Hebb begins by pointing out tftat we must have some way of reconciling 
the seemtngl, tncompatibic demands of perception and learning Learning 
apparently requires that s.eafic cells be excited for a specific fesp^n" to 

terns r ‘he other hand, seems to demand that pat- 

terns o exc, ation, rather than specific cells, be the physiological basis for 

psychological events. We know, for example, that a square looks hke a 
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hold the memory of some stimulation until a secondary but permanent con- 
nection has developed sufficiently to take over. The secondary change (the 
one responsible, says Hebb, for everything but events such as the imme- 
diate memory span) is the result of growth of nerve fibers toward one 
another at the synapse, or junction, between the nerve cells. He specifically 
suggests that one cell acts upon another to form knoblike growths that make 
the transmission of impulses through one particular group of cells easier 
than through another. The growth of the fibers itself is the result of activity 
of the nerve cells. Thus, when a particular conjunction of external stimuli 
occurs, they are likely to elicit activity in the same group of cells that was 
previously active. The result is a permanent connection between the ex- 
ternal stimuli and events in the central nervous system. 

The Potentiation Effect and Eccles's Theory 

This second, permanent mechanism for establishing connections between 
cells is very much like a mechanism suggested by the neurophysiologist 
Eccles (1953). Since Eccles derives his notions from direct neurophysio- 
logical evidence, it would appear that we are remarkably close to closing 
the gap between neurophysiology and the psychology of learning. 

Facilitation of one reflex function by the activation of another is a com- 
mon neurophysiological observation. Ordinarily, however, such facilitation 
is limited to the time at which the activation of the second system occurs 
or to a fraction of a second thereafter. Recently, however, neurophysiolo- 
gists (Lloyd, 1950; Eccles and Rail, 1951) have observed a facilitation 
effect which lasts not for a few thousandths of a second but for a good 
many minutes. This effect has been called post-tetanic potentiation. On a 
logarithmic time scale this long-lasting facilitation is well on the way to 
the relatively permanent facilitation of responses to conditioned stimuli 
produced by their being paired with unconditioned stimuli. 

Eccles believes the potentiation effect to occur by a swelling of the end 
knobs at synaptic junctions (this is, of course, similar to Hebb’s notion of 
end-knob growth). Eccles then goes on to develop a theory of conditioning 
built on the mechanism (end-knob swelling) that seems to account for post- 
tetanic potentiation. 

His theory of conditioning concerns the interaction between systems of 
neurons called pathways. These pathways converge on one another in such 
a way that the action of one tends to facilitate action in another. Thus the 
occurrence of excitation in the pathway carrying impulses from the uncon- 
ditioned stimulus facilitates the impulses from the conditioned stimulus so 
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that they carry over to the motor neurons and produce a conditioned re- 
sponse. 

The theory can account for some of the facts of classical conditioning 
(the fact, for example, that the conditioned response is not quite the same 
as the unconditioned response), but it is wide of the mark on many other 
facts. For one thing, the theory clearly predicts that a conditioned response 
should be weakened through disuse, and that, indeed, disuse should be a 
more effective weakening agent than experimental extinction. This is clearly 
contrary to fact, and presents a serious theoretical difficulty. Malmo (1954) 
has suggested that this could be corrected by appealing to a neurophysio- 
logical principle of inhibition (hyperpolarization). Thus it is quite likely 

that additional work may make it possible for the theory to account more 
adequately for the basic facts of conditioning. 

Although the theory does not adequately explain the behavioral facts it 
IS solidly based on neurophysiological concepts, and here lies its importance 
It is not an ad hoc theory put together to account for the behavioral facts 
of conditioning; it is a theory that attempts a genuine integration of neuro- 
physiological concepts and the principles of conditioning. It is possible that 
this represents the first real bridge between behavior and the basic laws 
governing the interaction between neurons in the central nervous system 
At any rate it is difficult to see how the long-lasting facilitation effects found 

in post-tetanic potentiation can be ignored in any future neurophysiological 
theory of conditioning. ° 


Ethology and the Psychology of Learning 

f ‘he most part, have been interested in 

he study of the basic principles of behavior, particularly as they apply to 

he behavior of human beings. Biologists and naturalists, on the other hand 

ave been interested in the behavior of animals and in the relation between 

k3 f r rr This 

of study has come to be known as ethology. Quite recently, the labora 

T behavior, characteristic of the experimental iychologists 

and the field approach, characteristic of the ethologists, have tended to 

come together. In this section we shall deal with some of the relations be 
tween ethology and the psychology of learning. 

Habituation and Imprinting 

We have described the two basic kinds of learning to be classical con 
ditiomng and mstrumental learning. We have, however, restricted our dis- 
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cussion almost entirely to the behavior of mammals. When we consider 

learning in other vertebrates and in invertebrates, the ethologist reminds 

us that there are kinds of learning which do not seem to be easily described 

as either classical or instrumental conditioning. Two other classes of the 

modification of behavior that have been intensively studied by students of 

animal behavior are habituation and imprinting, and we shall consider 
these briefly. 

Habituation. Humphrey (1933) gave us the first clear account of 
habituation and its importance in the animal world. Habituation is learning 
not to respond to stimuli which tend to be without significance in the life 
of the animal (Thorpe, 1956). A familiar example is the description of a 
simple experiment with a snail (Humphrey, 1933). If as the snail moves 
along a board, the board is periodically jerked, there is a reflex withdrawal 
of the tentacles. As the jerking is repeated, however, this withdrawal move- 
ment gradually diminishes and finally stops altogether. If the intensity of 
the stimulus is increased, the response will reappear, only to disappear with 
further repetitions. A period of rest will allow the response to recover spon- 
taneously, but as in the case of the spontaneous recovery of extinction, re- 
peated experimental sessions will finally cause the behavior to disappear 
more or less permanently. This last point is especially important, because it 
is this which differentiates habituation from simple fatigue. 

It is extremely difficult to obtain evidence for associative learning (either 
of the classical conditioning or the instrumental variety) in single-celled 
animals or in very primitive metazoa, such as the coelenterates. On the other 
hand, habituation can be readily demonstrated with every animal (Thorpe, 
1956) and it must be regarded as the most fundamental and elementary 
example of the permanent modification of behavior by repeated stimulation. 

Of course, habituation is very similar to experimental extinction, and it may 
well be that the most elementary examples of extinction are examples of 
habituation of learned rather than unlearned responses. 

Thorpe (1956) points out that habituation is of fundamental impjortancc 
in the natural life of animals. Furthermore, he emphasizes, it is probably 
related to the gradual atrophy of instinctive reactions when they are allowed 
to happen without consummation of the appropriate appetite behind them. 
Thus, both learned and unlearned acts seem to be subject to disappearance 
on repeated elicitation. It is only the activation of the motivation system 
(perhaps centered in the hypothalamus) that keeps responses alive. In other 
words, widespread habituation of behavior may very well be something 
which has to be overcome by the positive effects of motivation. 
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Imprinting. In recent years there has been a good deal of attention 
given to a peculiar variety of associative learning called imprinting. It was 
originally discovered in birds, but it is now suspected to be quite widespread 
m the animal kingdom. The peculiar features of imprinting are as follows; 
(1) it occurs with spontaneous rather than reflex behavior (though it is be- 
havior “released” by external stimulation), (2) it requires no specific rein- 
forcement for activation, and ( 3 ) it seems to occur only during a restricted 
period in the life of very young animals. 

The phenomenon of imprinting first received attention from the work of 
Lorenz (1935), who described a typical observation on imprinting. The 
eggs of a goose are hatched in an incubator and the young are restricted 
to the influence of human beings only; they never see the adults of their own 
species. The result is that the goslings come to follow the human being 
(when he quacks), rather than one of their own species. As we know, gos- 
lings typically follow their mother in a procession. This particular activity 
is not so much the result of the distinctive visual stimuli from the parent 
bird as it is the result of the association of any moving object the bird sees 
wrth the release of the response of following. This imprinting of the young 

bird appears to be limited to very brief periods in its life (Ramsey and 
Hess, 1954). 

It has sometimes been supposed that imprinting is irreversible both in 
the sense that it is never forgotten and in the sense that once a young bird 
IS imprinted to respond to a particular stimulus (say a moving shoe box) 

It cannot be imprinted to other objects. The latter statement is apparently 
untrue; Jaynes (1956) was able to obtain considerable eyidence of general- 
ization of imprinting to objects quite dissimUar to the original imprinted 
stimulus in young chicks. ^ 


j ^ All . new cues to command in- 

nate behavior. Although most of the evidence on imprinung has been ob- 
tained with birds, there is evidence that a similar effect occurs in mammals 
and perhaps in insects (Thorpe, 1956). The great importance of imprint- 
ing in the life of animals is the plasticity, or freedom, in response to stimuli 
hat It gives to certain kinds of instinctive activities. We are still uncertain 
about how it is related to the more universal kind of associative learning 
exemplified by classical conditioning and instrumental behavior UntU we 

know more about it, imprinting remains something of a curiosity in the 
description of learning. ^ 
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Ethology and Learning 

Thorpe ( 1956) in his comprehensive survey of the relationships between 

the mechanisms of learmng and the natural behavior of animals shows how 

particular aspects of learning such as stimulus generalization, latent learn- 

ing, trial and error, etc., fit into the biological pattern of survival in the 

natural world. He demonstrates how the mechanisms of latent learning work 

in exploratory activity and in the learning of topography in animals as primi- 
tive as insects. 

He describes, for example, the dependence upon learning of an instinctive 
bit of behavior, the egg-laying and provisioning behavior of the hunter wasp. 
This animal is able to make several burrows in different places, lay eggs in 
each of them, and return later to provide food for the young. The tendency 
to return is itself determined by innate releasing mechanisms (instinct), 
but the location of the burrows must be learned by the animal. The learning 
is latent because the behavior of returning to the burrow is not always ac- 
companied by direct reinforcement. 

Furthermore, Thorpe was able to demonstrate, by controlled observa- 
tions, that this must be the learning of stimulus relations rather than spe- 
cific responses, for these animals can be picked up and released in strange 
locations and still find their way back to the nest (Thorpe, 1950). Figure 
59 shows some of Thorpe’s tests for detour, or barrier-circumventing, be- 
havior in these wasps as well as the effects of release in a strange location. 
Notice the straight and regular path of the wasp to the burrow following its 
release in new surroundings. 

Thorpe ( 1956) argues that some of the spectacular observations on com- 
munication in bees that have received so much attention recently must indi- 
cate rather complicated latent learning in these animals. Bees can return 
to a hive after locating a source of sugar, and when they return they com- 
municate to other bees the direction and approximate distance of the source. 
The communication is by means of a complicated dance. Dethier (1957) 
has shown that a similar pattern of behavior in the fly can be accounted for 
by direct stimulus-response and physiological mechanisms, and it is very 
likely that the activity of the bee can be accounted for by similar mecha- 
nisms. Thus, spectacular as this behavior is, it is probably not an example 
of learned behavior. 

There is great difficulty in separating the influence of instinct and learn- 
ing in animals. Only now are we beginning to understand how these things 
interact in the natural world. Most animal behavior is a complicated inter- 
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Figure 59 . Shows the route token during detour and displacement experiments on on individual 
wasp. The heavy line shows the course of the wasp. A is the point at which it was first observed. The 
numbers 1 to 4 indicate the points at which metal screens were ploced in the wasp's path to force 
detouring. The insect wos captured at 8, It wos transported over the dotted line ond released at 
point C. The remaining numbers indicate further detour experiments. N is the nest, finally reached 
y the insect. The time taken by the insect was approximately 15 minutes. (Thorpe, 195d.) 


weaving of instinctive and learned components. Once an instinctive act has 

been released, it has opened the way for possible modification. Certain 

kinds of learning depend upon instinctlike mechanisms (imprinting), and 

so it is safe to say that learning is modified by instinct as much as instinct 
is modified by learning. 


LEARNING APPLIED TO SOCIAL AND PSYCHOLOGICAL PROBLEMS 

We have had little to say— and that only incidentaUy— about the appli- 
cation of the psychology of learning to general social and psychological 
problems. More often than not, the worker in the psychology of learning 
has had in mind only remotely, if at all, an eventual application of his work 
to the more pressing problems of human behavior. Nevertheless, the appli- 
cation has gone forward, sometimes wisely and sometimes not A few 
theorists and experimentalists have argued that the justification for the 
theoretical study of learning lies in its application, and some of these psy- 
chologists have enthusiastically applied the experimental evidence and 
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theories of learning to problems in personality development and social 
relations. 

Unfortunately, there is some confusion about the application of learning 
theory, because some ambiguity exists in the meaning of learning theory. 
In the past generation, theories of behavior have come to be identified 
with theories of learning. Thus, we find B. F. Skinner generally classified 
as a learning theorist, though he is very little concerned with learning and 
memory. With considerably more justification he could be called an in- 
centive theorist. We shall try to avoid this identification of learning with 
the general theoretical principles of behavior and try to restrict our discus- 
sion to associative problems. Of course, we shall not be able to ignore en- 
tirely motivational and emotional problems, since these frequently turn out 
to be based on associations between behavior and its consequences. 

The applications of the psychology of learning are numerous, and we 
can discuss only a few representative examples from the fields of personal- 
ity, social development, and educational psychology. These should give us 
some indication of the direction in which applications of learning have gone. 

Problems in Personality and Social Development 

In recent years there have been many attempts to apply the basic data 
and theories of learning to the problems of human personality. One of the 
ear lest is found in Guthrie’s Psychology of Human Conflict ( 1938) . Guthrie 
attempted to account for problems in personality and personal conflict by 
relying almost exclusively on classical conditioning as the model for all 
kinds of behavior, reflex and instrumental, emotional and intellectual. 
Guthrie’s attempt is distinguished by ingenuity and style, but it is more 
of a tour de force than a serious attempt to cope with the complicated prob- 
lems of human behavior in terms of associative learning. 

More recently, learning theorists have attempted to deal with more spe- 
cific problems, such as the course of psychotherapy. Furthermore, these 
attempts, unlike Guthrie’s, make some verifiable predictions about the 
problems with which they deal. In the next few pages we shall examine 

some of these applications of the basic learning principles to the analysis 
of personality and psychotherapy. 

One of the most persistent efforts of the immediate past has been to 
achieve a synthesis of psychoanalytic and learning theories. This has in- 
triguing possibilities, and since, as has often been pointed out, Freud was 
basically an associationist, it has not been so difficult a task as might be 
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supposed. Much of the work we shall discuss has been written from this 
synthetic psychoanalytic-learning point of view. 

Personality and Psychotherapy 

Shoben (1949) suggested that one of the major goals of psychotherapy 
IS the acquisition of symbolic control over behavior. The process of psycho- 
therapy is described by Shoben as the lifting of repression and the develop- 
ment of insight through symbolic reinstatement of the stimuli for anxiety, 
the diminution of anxiety by counter conditioning, and the process of re- 
education with the help of the therapist. Within this framework are many 
problems directly related to the principles of learning. 

Dollard and Miller.^ One of the most convincing attempts to apply the 
principles of learning to personality and psychotherapy is demonstrated in 
Dollard and Miller’s Personality and Psychotherapy (1950). The authors 
write from the point of view of Hull’s general theory of behavior and 
learning, but save for a few points, they could have written their book 
from the point of view of the purely empirical study of learning. Much of 
their book is devoted to an analysis of a case history and the course of 
psychotherapy in this case. This gives their book an extremely realistic 
flavor, but it only begins to suggest the possibilities of the analysis of psy- 
chotherapy from the point of view of the learning process. 

Dollard and Miller devote some space to outlining the basic principles 
of conditioning and learning. They then show how these principles can ac- 
count for the acquisition of fears and anxieties in infancy and childhood 
They argue, for example, that if a baby becomes very hungry when it is 
alone and cries and frets while the hunger drive increases, when it is finally 
fed there wUl be a large-scale reduction of need. These responses of crying. 

retting, etc., are reinforced and thus become the characteristic responses 
of the child when left alone. 

The bulk of their book is devoted to the consideration of psychotherapy 
as a learning process and to an analysis of the case illustrating the role of 
tnechanisms of learning in psychotherapy. Dollard and Miller start out by 
characterizing the neurotic person as someone who learns inadequate ways 
of thinking and dealing with himself. The inability to think about his basic 
problems may make the neurotic appear to be stupid; the pressures of con- 
flict make him feel miserable. Neurotic activities and psychosomatic symp- 
toms are the ways in which he has learned to alleviate his misery partially. 

3 See the section in Chap. 6 on DoIIard and Miller. 
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The business of psychotherapy, according to Dollard and Miller, is to 
correct the faulty habits the neurotic has learned. The therapist sets up a 
warm and permissive atmosphere in which the neurotic’s anxieties and fears 
can be symbolically introduced and allowed to extinguish. The therapist 
arouses memories of anxiety-inducing events in the neurotic’s life history, 
but it is done in the permissive atmosphere of therapy, so that these fears 
and anxieties are unreinforced and may therefore be extinguished. 

One of the problems for the therapist is to attach labels to things which 
the neurotic has lost the ability to deal with mentally. Words and sentences 
are attached to emotions formerly repressed. The neurotic can then deal 
more adequately with his inner feelings and experiences and he is made 
capable of dealing with his conflicts at an overt, conscious level. This is 
essentially the process of discrimination, and these two processes, extinc- 
tion of fear and anxiety and the development of discrimination, are the 
essential contributions of psychotherapy. 

Dollard and his associates (Dollard, Auld, and White, 1953) have done 
more than simply present the process of psychotherapy as interpreted by 
learning theory; they have suggested techniques for the evaluation of ther- 
apy and the analysis of its pfdgress based upon this interpretation. Thus, 
such an approach to psychotherapy is of considerable potential value to the 
practical clinical psychologist. 

Mowrer. A similar analysis of personality disorder and psychotherapy 
has been presented by Mowrer.^ More recently, however, Mowrer has ex- 
panded his point of view to encompass a vast multitude of scientific and 
moral problems in the development of personality and psychotherapy. He 
has considerably broadened the concepts of learning to be applied to the 
analysis and description of personality. He makes much more use of a 
concept of mediational activity (or “pure stimulus act,” as Hull has called 
it) than do Dollard and Miller. For example, he sees thinking as a form of 
activity in which the organism makes symbolic responses to find out what 
effects will occur if the action is really carried out (Mowrer, 1954). 
Thought, says Mowrer, is preparation for action, fantasy is substitution for 
action. Therefore, although he finds a role for the habit processes much 
like those suggested by Shoben and by DoUard and Miller, he finds the be- 
havior of the individual in psychotherapy much more complicated than 
learning theory allows. 

Indeed, learning theory does offer a most mechanical and limited model 
of the individual in difficulty. Since some model is better than none at all, 

4 See, for example, Mowrer (1950) 
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Mowrer and others of his general viewpoint run the danger of losing the 

The^Zt ^‘^hematize 

ood th ^ development. It is perhaps 

good that learning models of personality action and psychotherapy are 

ra er simple, for they encourage us to concentrate on solving a few ele- 

"Zr 7 Z'’ personality, on 

the other hand so overwhelm us with the complexities of the probleL to 

tribu^io '"■‘h that they discourage investigation. Thus, one of the real con- 

t"n ZZZs" 

l.earning and Socialization 

Children must learn to become adequate adults in the different societies 
to which they belong. Societies differ from one another considerably in the 
extent and kinds of demands they make upon individuals. Consequently 

tamT r socialization in different societies present an impor- 

tant challenge to the psychology of learning ^ 

find^wer^'Z; expect to 

1 worked out theoretical applications of the principles of learnine 

The work of Whit, ng and Child (1953) has suggested ways of studying the 

mfluence of infant and child care on the characteristics of adults in Jifffrent 

Child (1954) points out that the learning of infants and children is more 

childre^ generalized than that of older individuals. This is because young 
children are preverbal and thus do not have the mediating responses for ner 

forming fine discriminations. Such behavior is less susceptible to extincLn 
and verba control. Secondly, Child points out, the infant is helpless and 

lion'll' aT" subjected to drives of great intensity. Therefore, drive reduc- 
on IS a basic and powerful element in the control of behavior of verv 
y ung infants. Thus the stage is set for the learning of powerful emnf \ 

ieZmZZh -h:;:~L:Zn"tZtZ^^^ 

ZvTduaTr’ thZpersonry ol 

cipleTZZ roZof derivation from basic learning prin- 

those of the parent. Seward deduces that the kind of idp t fi’ 

oped by punishment will be closely linked to the kinds of behaZr acZfiy 
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punished, while approval will tend to produce identification with much 
wider generalization. 

The role of reinforcement is only one of the many mechanisms that 
work in the kind of socialization that leads to identification, but it is an 
important one. Notions like Seward’s have implications for a cross-cul- 
tural analysis of socialization, for it is possible to examine cultures which 
differ in the way in which identification is induced to see what differences 
are produced in the kinds of identification achieved. 

Learning and Theories of Personality 

It is encouraging to see that the general concepts which have arisen 
from the study of learning are now being incorporated into the body of 
personality theory. This is partly because of the close parallel between cer- 
tain classical psychoanalytic notions and some basic concepts in learning. 
Over and above this, however, there has been a genuinely new appreciation 
of the value of some of the familiar ideas in the study of learning for the 
understanding of personality development. 

Rotter (1954), for example, has presented a rather detailed theory of 
personality development from the point of view of social learning. He adds 
a new dimension to the understanding of personality by means of a concept 
of reinforcement value. He reminds us that not only are different reinforce- 
ments effective with different people, but that for any individual or group 
of individuals there is an organized hierarchy of values of reinforcement. 
Furthermore, these hierarchies evolve with experience, and this kind of 
change is one aspect of personality development. 

Rotter shows that the reinforcement value of a certain reward is to a 
considerable extent determined by the individual’s anticipation of the pat- 
tern or frequency of future reward. This notion allows Rotter to reinterpret 
the role of delay of reinforcement in maladjustive behavior. Various psy- 
chologists have supposed that maladjusted individuals, criminals, and neu- 
rotics are marked by an inability to act on the basis of delayed reinforce- 
ments; such people seek immediate gratification. Thus, says Rotter, if we 
were to offer a group of children the choice of a penny candy today or a 
5 cent candy tomorrow, at least some of the children would choose the 
penny candy today. From Rotter’s point of view, such a choice is not be- 
cause the delayed reinforcement has a lesser value, but because some indi- 
viduals have learned that reinforcements promised for the future are not 
likely to occur. 

Rotter and other theorists have proposed a large number of mechanisms 
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derived from basic studies of learning to account for various aspects of per- 
sonality development. These concepts should have the effect of leading to 
ruitful experimental investigation of personality mechanisms. 

Problems in Education 

At the beginning of the twentieth century there was much hope that the 

experimental study of learning would lead, in a fairly short time, to a scien- 

t.hc analysis of the problems of school and teaching. Many early books on 

learning were greatly concerned with the applications of existing facts to 

problems in education. In a very real sense, some valuable contributions 

were made to the practice of education. The early studies of transfer, par- 

icu arly those of Thorndike and Woodworth (1901 ), were influential in the 

decline ot the notion of formal discipline. All in all, however, the experi- 

mental study of learning did not live up to the early hopes for application 

hat aroused. By 1930, workers in this field disclaimed any attempt to 

achieve results which would have educational significance. At the same 

time a great interest in theories of learning arose, and these were mostly 
concerned with the behavior of animals. 

Early Problems in Educational Psychology 

The influence of E. L. Thorndike sat heavily upon educational psychol- 
gy for many years. Thorndike’s particular interpretation of learning— 

JIamt/t h' the core for the account of 

learning to be found m early textbooks on educational psychology. The 

principle of effect and trial-and-error behavior were the two central themes 
in early theories of learning. 

There is nothing wrong with an account of reinforcement and trial-and- 
error processes in a textbook in educational psychology, except that these 
things by thernselves do not make a complete and convincing pictured 
he learning of children. One of the outcomes of the disappoiLment with 
the classical Thorndikian point of view was that a formal diussion of the 
psycho ogy of learning all but disappeared from accounts of educational 
psychology^ Theories of learning contained so little of interest to educators 

cholo“ f H "" extremely minor role in the psy- 

chology of education. Since theories of learning had little or nothing of 

practica value to contribute, their place in the textbooks was taken by Lw 

material on personality development, mental hygiene etc ^ 

remained. This was 

y an account of the conditions which promote efficient classroom 
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learning. The textbooks discussed massed and distributed practice, the effect 
of recitation on memory, and motives in classroom learning. There was 
little of theoretical interest discussed; the effect of recitation on memory 
was simply presented as a fact, and its interpretation in terms of recoding 
and transfer was ignored. 

Transfer of training (although not always Thorndike’s concept of iden- 
tical elements) remained as the most basic contribution of learning to edu- 
cational psychology. This emphasis gradually changed, however. The ear- 
lier use of transfer was to further the social-utility movement in education. 
The apparent lack of transfer from the more traditional school subjects to 
the real and important concerns of human life was sufficient reason for de- 
creasing the emphasis upon these subjects. The curriculum was revised so 
that the traditional subjects would be more likely to transfer to the prob- 
lems of daily living. 

More recently, we have come to realize that maximizing transfer is some- 
times less important than other considerations. It is perhaps more valuable 
to produce a limited amount of transfer from some kind of activity that 
can go on only in the schools than to produce a lot of transfer that dupli- 
cates the activity of other agencies of the community. Consequently, the 
emphasis has changed from one of revising the curriculum to maximize 
transfer to one of maximizing transfer within the framework of a curricu- 
lum determined by the needs of the schools. 

Current Problems in Educational Psychology 

Much of educational psychology today is concerned with problems of 
personality development and individuality. Since there has been a growing 
interest in the application of theories of learning to individual development 
and socialization, much of the fundamental theoretical and experimental 
work in the psychology of learning is now returning to educational psy- 
chology, albeit with a different emphasis. 

In addition, of course, there have been many developments within edu- 
cational psychology itself. The dissatisfaction with the traditional material 
in learning and the conffict between the experimentalists, who insist upon 
working with pure variables, and the educators, who find that this produces 
little of direct application, are still with us. A few individuals, however, 
have been working at bringing together the experimental approach to learn- 
ing and the problems of classroom learning. Auble and Mech (1953), for 
example, deal with the application of partial reinforcement to behavior in 
the classroom. Their results suggest that reinforcement variables must be 
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treated with extreme caution in applied problems, since children are able 
to mediate their own reinforcements. The influence of intraverbal processes 
in children, the nature of the social situation in the classroom, and other 
conditions make the application of simple variables to classroom learning 
extremely difficult. The current attention of many experimentalists in class- 
room learning is now directed toward the study of the interaction between 
social and personality variables and the variables that control learning. 

Despite the many difficulties, Skinner (1954) has recommended a 
straightforward application of the reinforcement technique for the control 
of behavior to the classroom. He asks. What reinforcements are available 
for working with the child in school? The answer is in all of the things— 
scissors, puzzles, noise makers— which feed back to the child changes in the 
environment and which are free of aversive, or negatively reinforcing, prop- 
erties. Skinner argues that in the schoolroom today much of the positive 
reinforcement from sheer control of the environment is masked by the 
emotional responses generated by aversive conditions. 

Skinner even goes so far as to recommend automatic devices for the con- 
trol of reinforcement contingencies, since, he argues, the personal control 
of reinforcement by a teacher is too erratic and irregular. The free use of such 

evices, says Skinner, would free the teacher to do what she is best suited for 
deal with the socialization of the individual child. 


Although Skinner’s suggestions have outraged many people, some have 
been impressed. The kinds of techniques he suggests for the control of be- 
havior (and hence of learning) are now undergoing systematic testing In 
the next few years we should be able to evaluate them properly. 

Recent years have seen little in the way of large-scale theories of educa- 
tional psychology. Perhaps Skinner’s efforts will' point in this direction In 
a somewhat different direction, Stephens (1951, 1956) has advanced a 

1 f are many basic 

nondeliberative (one might almost say instinctive) factors underlying the 

e ucative process, and these, more than any particular method or practice 

in education, are the things that determine how schooling happens. 

Stephens started out originally with a hypothesis to account for the large 
iterature of rtegative findings in the field of experimental comparison of 
the different methods” of teaching. These appear to make little or no 
d fference in the outcome of education. This, Stephens argues, is so because 

o r f T T some 

m^itt^irs ^1 s ^?1 fl ” * * He argues that such 

matters as play, discursive conversation, tolerance of unusually communica- 
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Live people, and the fundamental operations of reinforcement all work no 

matter what the specific method. The method supplies direction and empha- 
sis, but the fundamental work is done by the nondeliberative factors. In- 
deed, these factors are responsible for the institution of schooling itself. 

This is radically different from the kind of theory currently predominant 
in psychological and sociological discussions of education, but it makes 
much of our knowledge of the pervasiveness of learning as an evolutionary 
mechanism and of the innately reinforcing properties of nonappetitive re- 
wards, which have received much attention lately. The introduction of such 
a theory seems to indicate a reawakening of interest in the fundamental 
connection between learning and educational theories. 

IN CONCLUSION 

In this short book we have followed a complicated path through the ex- 
perimental literature on learning. It is disturbing to think that many read- 
ers will be so lost in the details that they will fail to see the shape of things 
as a whole. In these last few paragraphs we shall attempt to give an over-all 
contour to what we have said. 

We have looked upon learning as a primarily associative process. The 
fundamental variables that control learning are temporal contiguity, simi- 
larity (both among responses, implicit and explicit, and among stimuli), 
repetitions of stimuli and responses, interaction between learned responses, 
and learned mediating responses. We have seen that the motive-incentive 
variables such as drive, emotion, and reinforcement (including time and 
pattern of reinforcement) determine the behavior of organisms. In so far 
as orgamsms learn what they do, these variables determine learning. Fur- 
thermore, a fundamental class of associations in the animal world is that 
of responses with reinforcing agencies. We have seen, however, that some- 
times when organisms behave without specifically channeled motives or 
well-defined reinforcements they still learn. Thus, these motive-incentive 
conditions which guide and control behavior are not essential to learning. 
The essential features of learning are associations among stimuli and among 
stimuli and responses as these occur in the experience of the organism. 

The variables determining learning interact with species and individual 
differences among organisms, and they are limited by the capacities of the ^ 
organisms. We have seen the limitation that the immediate memory span 
imposes on human memory and how this limitation is circumvented by repe- 
tition and recoding. We have seen how intraorganic symbolic processes are 
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necessary for certain kinds of “insightful” problem solving and how the 

lack of such processes produces an inevitable limit to the problem-solving 
ability of certain animals. ® 

We hav-e also discussed the role of language in recoding man’s experi- 
ence. We have had httle to say about the learning of linguistic habits them- 
selves. Such habits are, to some extent at least, associative; they are associa- 
tive m the sense that the learning of nonsense syllables provides a convenient 
and crude model for the development of such habits. Linguistic habits how- 
ever, are more than simply chains of probabilistic associations. Human lan- 
guage has a grammar whose existence makes the learning of natural linguistic 
abits very different from the learning of non.sense syllables. 

Grammar is not purely associationistic. There are certain elementary 
orms in grammar into which we can place words, ideas, even new words 
and new ideas that have never been used before. We can easily recognize 
he elements of the common forms of our mother tongue, even though non- 
nse may e put into those forms. “Gelix frimmaged a leble.” This can be 
Identified by most intelligent users of English as a simple, declarative sen- 
tence It ,s not particularly difficult to identify the subject, the verb the 
object, and the syntax in which these appear. Part of our learning of lln 
guage IS learning the sequential and semantic associations that are fhe com- 
mon property of the community, but another part is the learning of linguis- 
tic forms into which we can place our words. 

We have had little or nothing to say about how such grammatical forms 
are lea™d and for ,h= ,e„ good ,e.„„ ,ha, one oJThe ^ 2 - 

h re, Ig few v.rgina, ferri.ories let, tor research in verbal learning, and U 
to be recommended to anyone who wishes to make a fundamental con 
trffiution to ffie understanding of the human mind and its verbal processes 

procet”^’ h sequential and mediational aLciative 

processes we have made in this book needs some correction. These associa 

aE^ir "^so'^iations imposed within grammatical forms are 

In 3 h " '"r '"‘‘h them. 

our discussion of verbal learning we had little to say about meaning 

ffice It to say that the prevailing view in psycholinguistics today is thft 
meaning has its roots in association. The meaning of a word TZ 

offier\"e'‘b^r''S"' by the associations of ihat unit'wiffi 

Mea • nonverbal responses and with external stimuli generally 

nmgs, mdeed, seem frequently to be little more than the bridge between 
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one associative cluster and another. Thus, the general question of meaning 

is likely to get lost in the details of recoding, associative clustering, and 
stimulus generalization. 

There are many points at which the study of learning crosses the other 
disciplines concerned with man’s mind and its works. Language, neuro- 
physiology, personality development, and social anthropology — all cross in 
different ways. If we have achieved some understanding of the way in 
which organisms learn, we shall carry to these other disciplines something 
of fundamental importance. 

One basic idea that has come out of the systematic study of learning is 
the attitude that some of the complexities of human life and society can 
be understood in terms of the operation of basic learning laws. If the impli- 
cations of this idea have begun to stir, we will have accomplished something. 

Always, of course, a discipline such as the psychology of learning prom- 
ises more than it can achieve. It leaves a model of the organism that is 
stiff, mechanical, and only a faint copy of reality. It is, however, better 
than no model at all, and if we are to understand the behavior of organisms 
in an abstract, scientific sense we badly need such a model. The course of 
future work in learning and in the basic theory of behavior that goes along 
with it will be in the direction of introducing some flexibility and life into 
the model. At the same time, perhaps paradoxically, the model will proba- 
bly become more abstract. In other words, certain concepts we now use, 
which are crude physical models, will tend to become more and more 
abstract and mathematical in character. This dual process of introducing 
flexibility and abstractness into the behavioral model of organisms appears 
to be the fundamental task of future research in the psychology of learning. 

It also is beginning to appear more likely in fact, rather than merely in 
hope, that the psychology of learning will begin to approach a unity of con- 
cepts with other disciplines. Thus the fundamental associative concepts of 
the future will probably be stated in such a way that they have some neuro- 
physiological as well as behavioral validity. All this is in the future, how- 
ever, and at present our horizons are limited by our ability to see only the 
dimmest connections between learning and the nervous system and between 
learning and man’s best intellectual efforts. 

What has gone into this book is a short account of the present status of 
the psychology of learning. Like all healthy, growing scientific efforts, it is 
changing continuously. Some of the gaping deficiencies and points of ob- 
scurity and downright error will soon be corrected, but it is unlikely that 
the main outlines will change very much in the near future. 
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